Overview
Brought to you by YData
Dataset statistics
| Number of variables | 138 |
|---|---|
| Number of observations | 3814099 |
| Missing cells | 392091022 |
| Missing cells (%) | 74.5% |
| Total size in memory | 3.9 GiB |
| Average record size in memory | 1.1 KiB |
Variable types
| Text | 138 |
|---|
Dataset
| Description | NMNH Extant Specimen Records (USNM, US) 0049395-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.42mnjx |
datasetName has constant value "NMNH Extant Biology" | Constant |
reproductiveCondition has constant value "Animalia, Chordata, Vertebrata, Amphibia, Anura, Bufonidae" | Constant |
caste has constant value "Animalia" | Constant |
behavior has constant value "Chordata" | Constant |
vitality has constant value "Amphibia" | Constant |
establishmentMeans has constant value "Anura" | Constant |
pathway has constant value "Bufonidae" | Constant |
disposition has constant value "Rhinella" | Constant |
verbatimLabel has constant value "North America, Canada, Nunavut, Baffin Island" | Constant |
materialSampleID has constant value "North America" | Constant |
eventTime has constant value "Nunavut" | Constant |
sampleSizeValue has constant value "1000.0" | Constant |
eventRemarks has constant value "GPS" | Constant |
municipality has constant value "Degrees Minutes Seconds" | Constant |
locationRemarks has constant value "DeFilipps, R. A." | Constant |
georeferenceSources has constant value "29 Mar 1889" | Constant |
latestPeriodOrHighestSystem has constant value "177" | Constant |
bed has constant value "Riccardia pinguis" | Constant |
identificationID has constant value "Guadalupe Island, Baja California." | Constant |
taxonID has constant value "Metzgeriales" | Constant |
parentNameUsageID has constant value "Plantae, Dicotyledonae (basal), Magnoliales, Annonaceae, Annonoideae" | Constant |
originalNameUsageID has constant value "Plantae" | Constant |
taxonConceptID has constant value "Magnoliales" | Constant |
parentNameUsage has constant value "pinguis" | Constant |
namePublishedIn has constant value "Guatteria" | Constant |
namePublishedInYear has constant value "(K. Mert. ex Roth) Derbes & Solier" | Constant |
subfamily has constant value "(Aubl.) R.A. Howard" | Constant |
taxonomicStatus has constant value "Chordata" | Constant |
catalogNumber has 344412 (9.0%) missing values | Missing |
recordNumber has 1688619 (44.3%) missing values | Missing |
recordedBy has 806049 (21.1%) missing values | Missing |
sex has 3118857 (81.8%) missing values | Missing |
lifeStage has 3359653 (88.1%) missing values | Missing |
reproductiveCondition has 3814098 (> 99.9%) missing values | Missing |
caste has 3814098 (> 99.9%) missing values | Missing |
behavior has 3814098 (> 99.9%) missing values | Missing |
vitality has 3814098 (> 99.9%) missing values | Missing |
establishmentMeans has 3814098 (> 99.9%) missing values | Missing |
pathway has 3814098 (> 99.9%) missing values | Missing |
preparations has 1975286 (51.8%) missing values | Missing |
disposition has 3814098 (> 99.9%) missing values | Missing |
associatedMedia has 1396847 (36.6%) missing values | Missing |
associatedSequences has 3809026 (99.9%) missing values | Missing |
occurrenceRemarks has 3306658 (86.7%) missing values | Missing |
organismName has 3814097 (> 99.9%) missing values | Missing |
verbatimLabel has 3814098 (> 99.9%) missing values | Missing |
materialSampleID has 3814098 (> 99.9%) missing values | Missing |
eventType has 3814096 (> 99.9%) missing values | Missing |
fieldNumber has 3496495 (91.7%) missing values | Missing |
eventDate has 653351 (17.1%) missing values | Missing |
eventTime has 3814098 (> 99.9%) missing values | Missing |
startDayOfYear has 806907 (21.2%) missing values | Missing |
endDayOfYear has 805827 (21.1%) missing values | Missing |
year has 653351 (17.1%) missing values | Missing |
month has 799915 (21.0%) missing values | Missing |
day has 1074234 (28.2%) missing values | Missing |
verbatimEventDate has 2027788 (53.2%) missing values | Missing |
habitat has 3516278 (92.2%) missing values | Missing |
sampleSizeValue has 3814098 (> 99.9%) missing values | Missing |
eventRemarks has 3814098 (> 99.9%) missing values | Missing |
locationID has 3366761 (88.3%) missing values | Missing |
higherGeography has 118692 (3.1%) missing values | Missing |
continent has 534327 (14.0%) missing values | Missing |
waterBody has 3107446 (81.5%) missing values | Missing |
islandGroup has 3729526 (97.8%) missing values | Missing |
island has 3560499 (93.4%) missing values | Missing |
country has 160727 (4.2%) missing values | Missing |
stateProvince has 1028496 (27.0%) missing values | Missing |
county has 2948235 (77.3%) missing values | Missing |
municipality has 3814098 (> 99.9%) missing values | Missing |
locality has 544962 (14.3%) missing values | Missing |
verbatimLocality has 3814096 (> 99.9%) missing values | Missing |
minimumElevationInMeters has 2930460 (76.8%) missing values | Missing |
maximumElevationInMeters has 3486461 (91.4%) missing values | Missing |
verbatimElevation has 3703697 (97.1%) missing values | Missing |
minimumDepthInMeters has 3390497 (88.9%) missing values | Missing |
maximumDepthInMeters has 3423246 (89.8%) missing values | Missing |
verbatimDepth has 3790849 (99.4%) missing values | Missing |
locationRemarks has 3814098 (> 99.9%) missing values | Missing |
decimalLatitude has 2665103 (69.9%) missing values | Missing |
decimalLongitude has 2665103 (69.9%) missing values | Missing |
geodeticDatum has 3696977 (96.9%) missing values | Missing |
coordinateUncertaintyInMeters has 3744590 (98.2%) missing values | Missing |
coordinatePrecision has 3814096 (> 99.9%) missing values | Missing |
pointRadiusSpatialFit has 3814095 (> 99.9%) missing values | Missing |
verbatimCoordinates has 3814093 (> 99.9%) missing values | Missing |
verbatimLatitude has 3492892 (91.6%) missing values | Missing |
verbatimLongitude has 3493424 (91.6%) missing values | Missing |
verbatimCoordinateSystem has 3396655 (89.1%) missing values | Missing |
verbatimSRS has 3814097 (> 99.9%) missing values | Missing |
footprintSRS has 3814097 (> 99.9%) missing values | Missing |
footprintSpatialFit has 3814091 (> 99.9%) missing values | Missing |
georeferencedBy has 3814097 (> 99.9%) missing values | Missing |
georeferencedDate has 3814097 (> 99.9%) missing values | Missing |
georeferenceProtocol has 3320409 (87.1%) missing values | Missing |
georeferenceSources has 3814098 (> 99.9%) missing values | Missing |
georeferenceRemarks has 3730205 (97.8%) missing values | Missing |
geologicalContextID has 3814092 (> 99.9%) missing values | Missing |
earliestEonOrLowestEonothem has 3814086 (> 99.9%) missing values | Missing |
latestEonOrHighestEonothem has 3814091 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 3814096 (> 99.9%) missing values | Missing |
latestEraOrHighestErathem has 3814093 (> 99.9%) missing values | Missing |
earliestPeriodOrLowestSystem has 3814085 (> 99.9%) missing values | Missing |
latestPeriodOrHighestSystem has 3814098 (> 99.9%) missing values | Missing |
earliestEpochOrLowestSeries has 3814085 (> 99.9%) missing values | Missing |
latestEpochOrHighestSeries has 3814094 (> 99.9%) missing values | Missing |
earliestAgeOrLowestStage has 3814097 (> 99.9%) missing values | Missing |
latestAgeOrHighestStage has 3814092 (> 99.9%) missing values | Missing |
lowestBiostratigraphicZone has 3814093 (> 99.9%) missing values | Missing |
highestBiostratigraphicZone has 3814097 (> 99.9%) missing values | Missing |
lithostratigraphicTerms has 3814097 (> 99.9%) missing values | Missing |
formation has 3814092 (> 99.9%) missing values | Missing |
member has 3814097 (> 99.9%) missing values | Missing |
bed has 3814098 (> 99.9%) missing values | Missing |
identificationID has 3814098 (> 99.9%) missing values | Missing |
identificationQualifier has 3799723 (99.6%) missing values | Missing |
typeStatus has 3664511 (96.1%) missing values | Missing |
identifiedBy has 3157857 (82.8%) missing values | Missing |
identifiedByID has 3814092 (> 99.9%) missing values | Missing |
dateIdentified has 3814090 (> 99.9%) missing values | Missing |
identificationReferences has 3814093 (> 99.9%) missing values | Missing |
identificationVerificationStatus has 3814095 (> 99.9%) missing values | Missing |
identificationRemarks has 3814095 (> 99.9%) missing values | Missing |
taxonID has 3814098 (> 99.9%) missing values | Missing |
scientificNameID has 3814097 (> 99.9%) missing values | Missing |
acceptedNameUsageID has 3814096 (> 99.9%) missing values | Missing |
parentNameUsageID has 3814098 (> 99.9%) missing values | Missing |
originalNameUsageID has 3814098 (> 99.9%) missing values | Missing |
nameAccordingToID has 3814097 (> 99.9%) missing values | Missing |
namePublishedInID has 3814096 (> 99.9%) missing values | Missing |
taxonConceptID has 3814098 (> 99.9%) missing values | Missing |
scientificName has 152724 (4.0%) missing values | Missing |
acceptedNameUsage has 3814096 (> 99.9%) missing values | Missing |
parentNameUsage has 3814098 (> 99.9%) missing values | Missing |
originalNameUsage has 3814097 (> 99.9%) missing values | Missing |
namePublishedIn has 3814098 (> 99.9%) missing values | Missing |
namePublishedInYear has 3814098 (> 99.9%) missing values | Missing |
phylum has 1562087 (41.0%) missing values | Missing |
class has 102065 (2.7%) missing values | Missing |
order has 410734 (10.8%) missing values | Missing |
family has 101008 (2.6%) missing values | Missing |
subfamily has 3814098 (> 99.9%) missing values | Missing |
genus has 162837 (4.3%) missing values | Missing |
subgenus has 3729484 (97.8%) missing values | Missing |
infragenericEpithet has 3814097 (> 99.9%) missing values | Missing |
specificEpithet has 190700 (5.0%) missing values | Missing |
infraspecificEpithet has 3381784 (88.7%) missing values | Missing |
taxonRank has 3381907 (88.7%) missing values | Missing |
scientificNameAuthorship has 1431500 (37.5%) missing values | Missing |
vernacularName has 3814096 (> 99.9%) missing values | Missing |
nomenclaturalCode has 3814094 (> 99.9%) missing values | Missing |
taxonomicStatus has 3814098 (> 99.9%) missing values | Missing |
nomenclaturalStatus has 3814097 (> 99.9%) missing values | Missing |
taxonRemarks has 3814097 (> 99.9%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-14 16:35:58.342698 |
|---|---|
| Analysis finished | 2025-01-14 16:38:41.454503 |
| Duration | 2 minutes and 43.11 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 3814099 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 3814099 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1321585620 |
|---|---|
| 2nd row | 2452323322 |
| 3rd row | 1321585780 |
| 4th row | 1320143695 |
| 5th row | 2397792128 |
| Value | Count | Frequency (%) |
| 1321585620 | 1 | < 0.1% |
| 1321586280 | 1 | < 0.1% |
| 1321587590 | 1 | < 0.1% |
| 1321587488 | 1 | < 0.1% |
| 1320147229 | 1 | < 0.1% |
| 1320145108 | 1 | < 0.1% |
| 1321585780 | 1 | < 0.1% |
| 1320143695 | 1 | < 0.1% |
| 2397792128 | 1 | < 0.1% |
| 1320143630 | 1 | < 0.1% |
| Other values (3814089) | 3814089 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 6765665 | |
| 3 | 5422546 | |
| 2 | 5098023 | |
| 5 | 3086322 | |
| 8 | 3049375 | |
| 7 | 3042848 | |
| 0 | 2990957 | |
| 4 | 2929325 | |
| 6 | 2880749 | |
| 9 | 2875180 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 38140990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6765665 | |
| 3 | 5422546 | |
| 2 | 5098023 | |
| 5 | 3086322 | |
| 8 | 3049375 | |
| 7 | 3042848 | |
| 0 | 2990957 | |
| 4 | 2929325 | |
| 6 | 2880749 | |
| 9 | 2875180 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 38140990 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 6765665 | |
| 3 | 5422546 | |
| 2 | 5098023 | |
| 5 | 3086322 | |
| 8 | 3049375 | |
| 7 | 3042848 | |
| 0 | 2990957 | |
| 4 | 2929325 | |
| 6 | 2880749 | |
| 9 | 2875180 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38140990 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 6765665 | |
| 3 | 5422546 | |
| 2 | 5098023 | |
| 5 | 3086322 | |
| 8 | 3049375 | |
| 7 | 3042848 | |
| 0 | 2990957 | |
| 4 | 2929325 | |
| 6 | 2880749 | |
| 9 | 2875180 |
modified
Text
| Distinct | 286119 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 125475 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | 2023-05-10 09:22:00 |
|---|---|
| 2nd row | 2022-01-03 14:31:00 |
| 3rd row | 2022-08-17 11:23:00 |
| 4th row | 2022-12-30 12:34:00 |
| 5th row | 2019-07-10 10:37:00 |
| Value | Count | Frequency (%) |
| 2024-09-25 | 284800 | 3.7% |
| 2022-09-22 | 111266 | 1.5% |
| 2018-09-17 | 106629 | 1.4% |
| 2017-08-04 | 96033 | 1.3% |
| 2022-10-26 | 86404 | 1.1% |
| 2022-08-17 | 68138 | 0.9% |
| 2022-03-25 | 66124 | 0.9% |
| 2022-06-03 | 50303 | 0.7% |
| 2018-10-02 | 48765 | 0.6% |
| 2022-09-08 | 40115 | 0.5% |
| Other values (4857) | 6669621 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 18925013 | |
| 2 | 10663155 | |
| 1 | 9193210 | |
| - | 7628198 | |
| : | 7628198 | |
| 3814099 | 5.3% | |
| 4 | 2499475 | 3.4% |
| 3 | 2486901 | 3.4% |
| 5 | 2329941 | 3.2% |
| 9 | 2243456 | 3.1% |
| Other values (3) | 5056235 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 53397386 | |
| Dash Punctuation | 7628198 | 10.5% |
| Other Punctuation | 7628198 | 10.5% |
| Space Separator | 3814099 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 18925013 | |
| 2 | 10663155 | |
| 1 | 9193210 | |
| 4 | 2499475 | 4.7% |
| 3 | 2486901 | 4.7% |
| 5 | 2329941 | 4.4% |
| 9 | 2243456 | 4.2% |
| 8 | 1862086 | 3.5% |
| 7 | 1735441 | 3.3% |
| 6 | 1458708 | 2.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7628198 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 7628198 |
Space Separator
| Value | Count | Frequency (%) |
| 3814099 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 72467881 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 18925013 | |
| 2 | 10663155 | |
| 1 | 9193210 | |
| - | 7628198 | |
| : | 7628198 | |
| 3814099 | 5.3% | |
| 4 | 2499475 | 3.4% |
| 3 | 2486901 | 3.4% |
| 5 | 2329941 | 3.2% |
| 9 | 2243456 | 3.1% |
| Other values (3) | 5056235 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 72467881 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 18925013 | |
| 2 | 10663155 | |
| 1 | 9193210 | |
| - | 7628198 | |
| : | 7628198 | |
| 3814099 | 5.3% | |
| 4 | 2499475 | 3.4% |
| 3 | 2486901 | 3.4% |
| 5 | 2329941 | 3.2% |
| 9 | 2243456 | 3.1% |
| Other values (3) | 5056235 | 7.0% |
institutionID
Text
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 28.98748223 |
| Min length | 2 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:15463 |
| 3rd row | urn:lsid:biocol.org:col:34871 |
| 4th row | urn:lsid:biocol.org:col:34871 |
| 5th row | urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:34871 | 1956033 | |
| urn:lsid:biocol.org:col:15463 | 1856185 | |
| nsmt | 425 | < 0.1% |
| uam | 339 | < 0.1% |
| rmnh | 146 | < 0.1% |
| nrm | 137 | < 0.1% |
| nmv | 112 | < 0.1% |
| rcs | 95 | < 0.1% |
| nmsz | 77 | < 0.1% |
| zmmu | 70 | < 0.1% |
| Other values (33) | 480 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 15248872 | |
| : | 15248872 | |
| l | 11436654 | 10.3% |
| c | 7624436 | 6.9% |
| i | 7624436 | 6.9% |
| r | 7624436 | 6.9% |
| s | 3812218 | 3.4% |
| d | 3812218 | 3.4% |
| b | 3812218 | 3.4% |
| n | 3812218 | 3.4% |
| Other values (31) | 30504549 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 72432142 | |
| Other Punctuation | 19061090 | 17.2% |
| Decimal Number | 19061090 | 17.2% |
| Uppercase Letter | 6805 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1841 | |
| N | 1093 | |
| S | 749 | |
| A | 564 | 8.3% |
| U | 496 | 7.3% |
| T | 425 | 6.2% |
| R | 392 | 5.8% |
| H | 233 | 3.4% |
| C | 212 | 3.1% |
| Z | 195 | 2.9% |
| Other values (11) | 605 | 8.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 15248872 | |
| l | 11436654 | |
| c | 7624436 | |
| i | 7624436 | |
| r | 7624436 | |
| s | 3812218 | 5.3% |
| d | 3812218 | 5.3% |
| b | 3812218 | 5.3% |
| n | 3812218 | 5.3% |
| g | 3812218 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 3812218 | |
| 4 | 3812218 | |
| 1 | 3812218 | |
| 8 | 1956033 | |
| 7 | 1956033 | |
| 5 | 1856185 | |
| 6 | 1856185 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 15248872 | |
| . | 3812218 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 72438947 | |
| Common | 38122180 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 15248872 | |
| l | 11436654 | |
| c | 7624436 | |
| i | 7624436 | |
| r | 7624436 | |
| s | 3812218 | 5.3% |
| d | 3812218 | 5.3% |
| b | 3812218 | 5.3% |
| n | 3812218 | 5.3% |
| g | 3812218 | 5.3% |
| Other values (22) | 3819023 | 5.3% |
Common
| Value | Count | Frequency (%) |
| : | 15248872 | |
| . | 3812218 | 10.0% |
| 3 | 3812218 | 10.0% |
| 4 | 3812218 | 10.0% |
| 1 | 3812218 | 10.0% |
| 8 | 1956033 | 5.1% |
| 7 | 1956033 | 5.1% |
| 5 | 1856185 | 4.9% |
| 6 | 1856185 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 110561127 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 15248872 | |
| : | 15248872 | |
| l | 11436654 | 10.3% |
| c | 7624436 | 6.9% |
| i | 7624436 | 6.9% |
| r | 7624436 | 6.9% |
| s | 3812218 | 3.4% |
| d | 3812218 | 3.4% |
| b | 3812218 | 3.4% |
| n | 3812218 | 3.4% |
| Other values (31) | 30504549 |
collectionID
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
|---|---|
| 2nd row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
| 3rd row | urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 |
| 4th row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 5th row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| Value | Count | Frequency (%) |
| urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 | 1856185 | |
| urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 | 792284 | |
| urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad | 249390 | 6.5% |
| urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 | 247291 | 6.5% |
| urn:uuid:73d83e23-1999-42cd-b38a-c06a7d32d893 | 240577 | 6.3% |
| urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 | 240491 | 6.3% |
| urn:uuid:09c9cf5f-f5d3-48cc-b5c8-cd9b9fbd631f | 187881 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 15256396 | 8.9% |
| 8 | 13115302 | 7.6% |
| d | 11592424 | 6.8% |
| u | 11442297 | 6.7% |
| 3 | 10471017 | 6.1% |
| e | 9689355 | 5.6% |
| c | 9414596 | 5.5% |
| 1 | 9256391 | 5.4% |
| a | 8567826 | 5.0% |
| 6 | 8270441 | 4.8% |
| Other values (12) | 64558410 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 76347338 | |
| Lowercase Letter | 72402523 | |
| Dash Punctuation | 15256396 | 8.9% |
| Other Punctuation | 7628198 | 4.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 13115302 | |
| 3 | 10471017 | |
| 1 | 9256391 | |
| 6 | 8270441 | |
| 2 | 7804315 | |
| 4 | 7742634 | |
| 7 | 6996246 | |
| 9 | 6388438 | |
| 0 | 4500357 | 5.9% |
| 5 | 1802197 | 2.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 11592424 | |
| u | 11442297 | |
| e | 9689355 | |
| c | 9414596 | |
| a | 8567826 | |
| f | 6150105 | |
| b | 4103623 | 5.7% |
| r | 3814099 | 5.3% |
| i | 3814099 | 5.3% |
| n | 3814099 | 5.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15256396 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 7628198 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 99231932 | |
| Latin | 72402523 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 15256396 | |
| 8 | 13115302 | |
| 3 | 10471017 | |
| 1 | 9256391 | |
| 6 | 8270441 | |
| 2 | 7804315 | |
| 4 | 7742634 | |
| : | 7628198 | |
| 7 | 6996246 | |
| 9 | 6388438 | |
| Other values (2) | 6302554 |
Latin
| Value | Count | Frequency (%) |
| d | 11592424 | |
| u | 11442297 | |
| e | 9689355 | |
| c | 9414596 | |
| a | 8567826 | |
| f | 6150105 | |
| b | 4103623 | 5.7% |
| r | 3814099 | 5.3% |
| i | 3814099 | 5.3% |
| n | 3814099 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 171634455 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 15256396 | 8.9% |
| 8 | 13115302 | 7.6% |
| d | 11592424 | 6.8% |
| u | 11442297 | 6.7% |
| 3 | 10471017 | 6.1% |
| e | 9689355 | 5.6% |
| c | 9414596 | 5.5% |
| 1 | 9256391 | 5.4% |
| a | 8567826 | 5.0% |
| 6 | 8270441 | 4.8% |
| Other values (12) | 64558410 |
institutionCode
Text
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 3.026483319 |
| Min length | 2 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | US |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 1956033 | |
| us | 1856185 | |
| nsmt | 425 | < 0.1% |
| uam | 339 | < 0.1% |
| rmnh | 146 | < 0.1% |
| nrm | 137 | < 0.1% |
| nmv | 112 | < 0.1% |
| rcs | 95 | < 0.1% |
| nmsz | 77 | < 0.1% |
| zmmu | 70 | < 0.1% |
| Other values (33) | 480 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 3812967 | |
| U | 3812714 | |
| M | 1957874 | |
| N | 1957126 | |
| A | 564 | < 0.1% |
| T | 425 | < 0.1% |
| R | 392 | < 0.1% |
| H | 233 | < 0.1% |
| C | 212 | < 0.1% |
| Z | 195 | < 0.1% |
| Other values (11) | 605 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 11543307 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3812967 | |
| U | 3812714 | |
| M | 1957874 | |
| N | 1957126 | |
| A | 564 | < 0.1% |
| T | 425 | < 0.1% |
| R | 392 | < 0.1% |
| H | 233 | < 0.1% |
| C | 212 | < 0.1% |
| Z | 195 | < 0.1% |
| Other values (11) | 605 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11543307 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 3812967 | |
| U | 3812714 | |
| M | 1957874 | |
| N | 1957126 | |
| A | 564 | < 0.1% |
| T | 425 | < 0.1% |
| R | 392 | < 0.1% |
| H | 233 | < 0.1% |
| C | 212 | < 0.1% |
| Z | 195 | < 0.1% |
| Other values (11) | 605 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11543307 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 3812967 | |
| U | 3812714 | |
| M | 1957874 | |
| N | 1957126 | |
| A | 564 | < 0.1% |
| T | 425 | < 0.1% |
| R | 392 | < 0.1% |
| H | 233 | < 0.1% |
| C | 212 | < 0.1% |
| Z | 195 | < 0.1% |
| Other values (11) | 605 | < 0.1% |
collectionCode
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 2.608911043 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | IZ |
|---|---|
| 2nd row | US |
| 3rd row | HERP |
| 4th row | IZ |
| 5th row | IZ |
| Value | Count | Frequency (%) |
| us | 1856185 | |
| iz | 792284 | |
| ent | 249390 | 6.5% |
| mamm | 247291 | 6.5% |
| birds | 240577 | 6.3% |
| herp | 240491 | 6.3% |
| fish | 187881 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2284643 | |
| U | 1856185 | |
| I | 1220742 | |
| Z | 792284 | 8.0% |
| M | 741873 | 7.5% |
| E | 489881 | 4.9% |
| R | 481068 | 4.8% |
| H | 428372 | 4.3% |
| N | 249390 | 2.5% |
| T | 249390 | 2.5% |
| Other values (5) | 1156817 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9950645 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2284643 | |
| U | 1856185 | |
| I | 1220742 | |
| Z | 792284 | 8.0% |
| M | 741873 | 7.5% |
| E | 489881 | 4.9% |
| R | 481068 | 4.8% |
| H | 428372 | 4.3% |
| N | 249390 | 2.5% |
| T | 249390 | 2.5% |
| Other values (5) | 1156817 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9950645 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2284643 | |
| U | 1856185 | |
| I | 1220742 | |
| Z | 792284 | 8.0% |
| M | 741873 | 7.5% |
| E | 489881 | 4.9% |
| R | 481068 | 4.8% |
| H | 428372 | 4.3% |
| N | 249390 | 2.5% |
| T | 249390 | 2.5% |
| Other values (5) | 1156817 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9950645 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 2284643 | |
| U | 1856185 | |
| I | 1220742 | |
| Z | 792284 | 8.0% |
| M | 741873 | 7.5% |
| E | 489881 | 4.9% |
| R | 481068 | 4.8% |
| H | 428372 | 4.3% |
| N | 249390 | 2.5% |
| T | 249390 | 2.5% |
| Other values (5) | 1156817 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 3814099 | |
| extant | 3814099 | |
| biology | 3814099 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 7628198 | 10.5% |
| 7628198 | 10.5% | |
| t | 7628198 | 10.5% |
| o | 7628198 | 10.5% |
| M | 3814099 | 5.3% |
| H | 3814099 | 5.3% |
| E | 3814099 | 5.3% |
| x | 3814099 | 5.3% |
| a | 3814099 | 5.3% |
| n | 3814099 | 5.3% |
| Other values (5) | 19070495 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 41955089 | |
| Uppercase Letter | 22884594 | |
| Space Separator | 7628198 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 7628198 | |
| o | 7628198 | |
| x | 3814099 | |
| a | 3814099 | |
| n | 3814099 | |
| i | 3814099 | |
| l | 3814099 | |
| g | 3814099 | |
| y | 3814099 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 7628198 | |
| M | 3814099 | |
| H | 3814099 | |
| E | 3814099 | |
| B | 3814099 |
Space Separator
| Value | Count | Frequency (%) |
| 7628198 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 64839683 | |
| Common | 7628198 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 7628198 | |
| t | 7628198 | |
| o | 7628198 | |
| M | 3814099 | 5.9% |
| H | 3814099 | 5.9% |
| E | 3814099 | 5.9% |
| x | 3814099 | 5.9% |
| a | 3814099 | 5.9% |
| n | 3814099 | 5.9% |
| B | 3814099 | 5.9% |
| Other values (4) | 15256396 |
Common
| Value | Count | Frequency (%) |
| 7628198 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 72467881 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 7628198 | 10.5% |
| 7628198 | 10.5% | |
| t | 7628198 | 10.5% |
| o | 7628198 | 10.5% |
| M | 3814099 | 5.3% |
| H | 3814099 | 5.3% |
| E | 3814099 | 5.3% |
| x | 3814099 | 5.3% |
| a | 3814099 | 5.3% |
| n | 3814099 | 5.3% |
| Other values (5) | 19070495 |
basisOfRecord
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 17.0061076 |
| Min length | 16 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PreservedSpecimen |
|---|---|
| 2nd row | PreservedSpecimen |
| 3rd row | PreservedSpecimen |
| 4th row | PreservedSpecimen |
| 5th row | PreservedSpecimen |
| Value | Count | Frequency (%) |
| preservedspecimen | 3763178 | |
| machineobservation | 37108 | 1.0% |
| humanobservation | 13813 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 18903919 | |
| r | 7577277 | |
| n | 3865020 | 6.0% |
| i | 3851207 | 5.9% |
| s | 3814099 | 5.9% |
| v | 3814099 | 5.9% |
| c | 3800286 | 5.9% |
| m | 3776991 | 5.8% |
| P | 3763178 | 5.8% |
| p | 3763178 | 5.8% |
| Other values (11) | 7933724 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 57234780 | |
| Uppercase Letter | 7628198 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 18903919 | |
| r | 7577277 | |
| n | 3865020 | 6.8% |
| i | 3851207 | 6.7% |
| s | 3814099 | 6.7% |
| v | 3814099 | 6.7% |
| c | 3800286 | 6.6% |
| m | 3776991 | 6.6% |
| p | 3763178 | 6.6% |
| d | 3763178 | 6.6% |
| Other values (6) | 305526 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3763178 | |
| S | 3763178 | |
| O | 50921 | 0.7% |
| M | 37108 | 0.5% |
| H | 13813 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 64862978 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 18903919 | |
| r | 7577277 | |
| n | 3865020 | 6.0% |
| i | 3851207 | 5.9% |
| s | 3814099 | 5.9% |
| v | 3814099 | 5.9% |
| c | 3800286 | 5.9% |
| m | 3776991 | 5.8% |
| P | 3763178 | 5.8% |
| p | 3763178 | 5.8% |
| Other values (11) | 7933724 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 64862978 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 18903919 | |
| r | 7577277 | |
| n | 3865020 | 6.0% |
| i | 3851207 | 5.9% |
| s | 3814099 | 5.9% |
| v | 3814099 | 5.9% |
| c | 3800286 | 5.9% |
| m | 3776991 | 5.8% |
| P | 3763178 | 5.8% |
| p | 3763178 | 5.8% |
| Other values (11) | 7933724 |
occurrenceID
Text
Unique 
| Distinct | 3814099 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 3814099 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/3c1d5cd1b-23f9-4aab-8cd8-011e6535be18 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/38212d138-cfcd-4363-8d3b-93b82afc1d4b |
| 3rd row | http://n2t.net/ark:/65665/3c1d69371-acc7-4c47-bc57-9d5ba7994267 |
| 4th row | http://n2t.net/ark:/65665/382140f93-30c1-4f26-bd0c-77d197d5ebc0 |
| 5th row | http://n2t.net/ark:/65665/3c1d814f8-bb57-4c37-a953-dd84b1c6415d |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/3c1d5cd1b-23f9-4aab-8cd8-011e6535be18 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c1dca2ff-5a4b-407d-be1e-8c2465e2dbc4 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c1eb4e39-5ffd-4448-b2ce-395313b0c10e | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c1ea3d60-dba8-415e-80b6-a9bc8c946ff8 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3823c9e76-01df-419c-aa05-a6aec0f69473 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/382257af5-f81d-4f8a-aff0-9f0f328b0fdb | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c1d69371-acc7-4c47-bc57-9d5ba7994267 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/382140f93-30c1-4f26-bd0c-77d197d5ebc0 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c1d814f8-bb57-4c37-a953-dd84b1c6415d | 1 | < 0.1% |
| http://n2t.net/ark:/65665/38215186e-af4f-46dc-8b81-ec58617bdfd7 | 1 | < 0.1% |
| Other values (3814089) | 3814089 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 19070495 | 7.9% |
| 6 | 18595063 | 7.7% |
| - | 15256396 | 6.3% |
| t | 15256396 | 6.3% |
| 5 | 14773602 | 6.1% |
| a | 11917774 | 5.0% |
| 2 | 10968294 | 4.6% |
| e | 10967918 | 4.6% |
| 3 | 10962313 | 4.6% |
| 4 | 10959604 | 4.6% |
| Other values (16) | 101560382 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 103921686 | |
| Lowercase Letter | 90597363 | |
| Other Punctuation | 30512792 | 12.7% |
| Dash Punctuation | 15256396 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 15256396 | |
| a | 11917774 | |
| e | 10967918 | |
| b | 8105562 | |
| n | 7628198 | |
| c | 7157232 | |
| d | 7155364 | |
| f | 7152523 | |
| k | 3814099 | 4.2% |
| r | 3814099 | 4.2% |
| Other values (2) | 7628198 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 18595063 | |
| 5 | 14773602 | |
| 2 | 10968294 | |
| 3 | 10962313 | |
| 4 | 10959604 | |
| 9 | 8108056 | |
| 8 | 8105434 | |
| 1 | 7152772 | 6.9% |
| 0 | 7148472 | 6.9% |
| 7 | 7148076 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 19070495 | |
| : | 7628198 | 25.0% |
| . | 3814099 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15256396 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 149690874 | |
| Latin | 90597363 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 19070495 | |
| 6 | 18595063 | |
| - | 15256396 | |
| 5 | 14773602 | |
| 2 | 10968294 | |
| 3 | 10962313 | |
| 4 | 10959604 | |
| 9 | 8108056 | 5.4% |
| 8 | 8105434 | 5.4% |
| : | 7628198 | 5.1% |
| Other values (4) | 25263419 |
Latin
| Value | Count | Frequency (%) |
| t | 15256396 | |
| a | 11917774 | |
| e | 10967918 | |
| b | 8105562 | |
| n | 7628198 | |
| c | 7157232 | |
| d | 7155364 | |
| f | 7152523 | |
| k | 3814099 | 4.2% |
| r | 3814099 | 4.2% |
| Other values (2) | 7628198 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 240288237 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 19070495 | 7.9% |
| 6 | 18595063 | 7.7% |
| - | 15256396 | 6.3% |
| t | 15256396 | 6.3% |
| 5 | 14773602 | 6.1% |
| a | 11917774 | 5.0% |
| 2 | 10968294 | 4.6% |
| e | 10967918 | 4.6% |
| 3 | 10962313 | 4.6% |
| 4 | 10959604 | 4.6% |
| Other values (16) | 101560382 |
catalogNumber
Text
Missing 
| Distinct | 2680425 |
|---|---|
| Distinct (%) | 77.3% |
| Missing | 344412 |
| Missing (%) | 9.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 21 |
| Mean length | 10.5415526 |
| Min length | 4 |
Unique
| Unique | 2220561 ? |
|---|---|
| Unique (%) | 64.0% |
Sample
| 1st row | USNM 1220020 |
|---|---|
| 2nd row | US 2327562 |
| 3rd row | USNM 359728 |
| 4th row | USNM 65866 |
| 5th row | USNM 1569732 |
| Value | Count | Frequency (%) |
| usnm | 1706642 | |
| us | 1589980 | |
| herp | 2389 | < 0.1% |
| tissue | 2336 | < 0.1% |
| sem | 97 | < 0.1% |
| 69 | < 0.1% | |
| 1 | 61 | < 0.1% |
| stub | 57 | < 0.1% |
| image | 53 | < 0.1% |
| micrograph | 40 | < 0.1% |
| Other values (2298683) | 3469782 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 3476269 | 9.5% |
| U | 3468327 | 9.5% |
| 3301819 | 9.0% | |
| 1 | 2915757 | 8.0% |
| 2 | 2640645 | 7.2% |
| 3 | 2480659 | 6.8% |
| 0 | 2116762 | 5.8% |
| 4 | 2111189 | 5.8% |
| 5 | 2073315 | 5.7% |
| N | 2019199 | 5.5% |
| Other values (59) | 9971947 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 21938221 | |
| Uppercase Letter | 11281252 | |
| Space Separator | 3301819 | 9.0% |
| Lowercase Letter | 42269 | 0.1% |
| Dash Punctuation | 9275 | < 0.1% |
| Other Punctuation | 3024 | < 0.1% |
| Close Punctuation | 14 | < 0.1% |
| Open Punctuation | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 17734 | |
| e | 4857 | 11.5% |
| s | 4672 | 11.1% |
| a | 3519 | 8.3% |
| r | 2470 | 5.8% |
| p | 2432 | 5.8% |
| u | 2407 | 5.7% |
| i | 2385 | 5.6% |
| b | 800 | 1.9% |
| c | 291 | 0.7% |
| Other values (16) | 702 | 1.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3476269 | |
| U | 3468327 | |
| N | 2019199 | |
| M | 1873513 | |
| E | 181447 | 1.6% |
| T | 162747 | 1.4% |
| A | 27947 | 0.2% |
| D | 27138 | 0.2% |
| R | 18007 | 0.2% |
| B | 14421 | 0.1% |
| Other values (15) | 12237 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2915757 | |
| 2 | 2640645 | |
| 3 | 2480659 | |
| 0 | 2116762 | |
| 4 | 2111189 | |
| 5 | 2073315 | |
| 6 | 1957916 | |
| 7 | 1917347 | |
| 8 | 1892550 | |
| 9 | 1832081 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1723 | |
| * | 1295 | |
| ? | 5 | 0.2% |
| ' | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3301819 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9275 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 14 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 25252367 | |
| Latin | 11323521 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 3476269 | |
| U | 3468327 | |
| N | 2019199 | |
| M | 1873513 | |
| E | 181447 | 1.6% |
| T | 162747 | 1.4% |
| A | 27947 | 0.2% |
| D | 27138 | 0.2% |
| R | 18007 | 0.2% |
| w | 17734 | 0.2% |
| Other values (41) | 51193 | 0.5% |
Common
| Value | Count | Frequency (%) |
| 3301819 | ||
| 1 | 2915757 | |
| 2 | 2640645 | |
| 3 | 2480659 | |
| 0 | 2116762 | |
| 4 | 2111189 | |
| 5 | 2073315 | |
| 6 | 1957916 | |
| 7 | 1917347 | |
| 8 | 1892550 | |
| Other values (8) | 1844408 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36575888 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 3476269 | 9.5% |
| U | 3468327 | 9.5% |
| 3301819 | 9.0% | |
| 1 | 2915757 | 8.0% |
| 2 | 2640645 | 7.2% |
| 3 | 2480659 | 6.8% |
| 0 | 2116762 | 5.8% |
| 4 | 2111189 | 5.8% |
| 5 | 2073315 | 5.7% |
| N | 2019199 | 5.5% |
| Other values (59) | 9971947 |
recordNumber
Text
Missing 
| Distinct | 368960 |
|---|---|
| Distinct (%) | 17.4% |
| Missing | 1688619 |
| Missing (%) | 44.3% |
| Memory size | 29.1 MiB |
Length
| Max length | 93 |
|---|---|
| Median length | 90 |
| Mean length | 4.785350133 |
| Min length | 1 |
Unique
| Unique | 292732 ? |
|---|---|
| Unique (%) | 13.8% |
Sample
| 1st row | 5209 |
|---|---|
| 2nd row | USNPC # 008843 |
| 3rd row | USNPC # 074963 |
| 4th row | 478 |
| 5th row | s.n. |
| Value | Count | Frequency (%) |
| s.n | 264664 | 11.1% |
| 41913 | 1.8% | |
| usnpc | 36535 | 1.5% |
| no | 19873 | 0.8% |
| number | 19484 | 0.8% |
| bureau | 8434 | 0.4% |
| eyd | 6470 | 0.3% |
| s | 5865 | 0.2% |
| n | 5665 | 0.2% |
| of | 5647 | 0.2% |
| Other values (270340) | 1961294 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1157151 | |
| 2 | 898438 | 8.8% |
| 3 | 773609 | 7.6% |
| 0 | 742907 | 7.3% |
| 4 | 724395 | 7.1% |
| 5 | 695725 | 6.8% |
| 6 | 673176 | 6.6% |
| 7 | 636233 | 6.3% |
| 8 | 610720 | 6.0% |
| 9 | 594451 | 5.8% |
| Other values (102) | 2664361 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7506805 | |
| Lowercase Letter | 877335 | 8.6% |
| Uppercase Letter | 735091 | 7.2% |
| Other Punctuation | 641780 | 6.3% |
| Space Separator | 250364 | 2.5% |
| Dash Punctuation | 145228 | 1.4% |
| Connector Punctuation | 6142 | 0.1% |
| Close Punctuation | 3742 | < 0.1% |
| Open Punctuation | 3741 | < 0.1% |
| Other Number | 651 | < 0.1% |
| Other values (4) | 287 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 283424 | |
| s | 277432 | |
| e | 48508 | 5.5% |
| u | 39655 | 4.5% |
| r | 39309 | 4.5% |
| o | 35505 | 4.0% |
| a | 30735 | 3.5% |
| b | 29074 | 3.3% |
| m | 21508 | 2.5% |
| c | 16556 | 1.9% |
| Other values (26) | 55629 | 6.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 95367 | |
| S | 74319 | 10.1% |
| C | 61194 | 8.3% |
| P | 58289 | 7.9% |
| U | 43654 | 5.9% |
| B | 39926 | 5.4% |
| A | 38358 | 5.2% |
| H | 32338 | 4.4% |
| D | 30688 | 4.2% |
| L | 29714 | 4.0% |
| Other values (19) | 231244 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 560765 | |
| # | 36852 | 5.7% |
| / | 19492 | 3.0% |
| & | 10338 | 1.6% |
| * | 5808 | 0.9% |
| ? | 4401 | 0.7% |
| , | 2500 | 0.4% |
| ! | 976 | 0.2% |
| : | 367 | 0.1% |
| ; | 177 | < 0.1% |
| Other values (5) | 104 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1157151 | |
| 2 | 898438 | |
| 3 | 773609 | |
| 0 | 742907 | |
| 4 | 724395 | |
| 5 | 695725 | |
| 6 | 673176 | |
| 7 | 636233 | |
| 8 | 610720 | |
| 9 | 594451 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 622 | |
| ² | 10 | 1.5% |
| ¼ | 9 | 1.4% |
| ¾ | 4 | 0.6% |
| ³ | 3 | 0.5% |
| ⅓ | 3 | 0.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3462 | |
| ] | 180 | 4.8% |
| } | 100 | 2.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3461 | |
| [ | 180 | 4.8% |
| { | 100 | 2.7% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 214 | |
| + | 66 | 23.4% |
| ~ | 2 | 0.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 145227 | |
| – | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 250364 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 6142 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 3 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| › | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8558740 | |
| Latin | 1612425 | 15.9% |
| Greek | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 283424 | |
| s | 277432 | |
| N | 95367 | 5.9% |
| S | 74319 | 4.6% |
| C | 61194 | 3.8% |
| P | 58289 | 3.6% |
| e | 48508 | 3.0% |
| U | 43654 | 2.7% |
| B | 39926 | 2.5% |
| u | 39655 | 2.5% |
| Other values (54) | 590657 |
Common
| Value | Count | Frequency (%) |
| 1 | 1157151 | |
| 2 | 898438 | |
| 3 | 773609 | |
| 0 | 742907 | |
| 4 | 724395 | |
| 5 | 695725 | |
| 6 | 673176 | |
| 7 | 636233 | |
| 8 | 610720 | |
| 9 | 594451 | |
| Other values (37) | 1051935 |
Greek
| Value | Count | Frequency (%) |
| Σ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10170472 | |
| None | 688 | < 0.1% |
| Number Forms | 3 | < 0.1% |
| Punctuation | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1157151 | |
| 2 | 898438 | 8.8% |
| 3 | 773609 | 7.6% |
| 0 | 742907 | 7.3% |
| 4 | 724395 | 7.1% |
| 5 | 695725 | 6.8% |
| 6 | 673176 | 6.6% |
| 7 | 636233 | 6.3% |
| 8 | 610720 | 6.0% |
| 9 | 594451 | 5.8% |
| Other values (78) | 2663667 |
None
| Value | Count | Frequency (%) |
| ½ | 622 | |
| è | 13 | 1.9% |
| ² | 10 | 1.5% |
| ¼ | 9 | 1.3% |
| é | 5 | 0.7% |
| á | 4 | 0.6% |
| ¾ | 4 | 0.6% |
| ³ | 3 | 0.4% |
| ó | 3 | 0.4% |
| ¢ | 3 | 0.4% |
| Other values (10) | 12 | 1.7% |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 3 |
Punctuation
| Value | Count | Frequency (%) |
| – | 1 | |
| … | 1 | |
| › | 1 |
recordedBy
Text
Missing 
| Distinct | 146306 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 806049 |
| Missing (%) | 21.1% |
| Memory size | 29.1 MiB |
Length
| Max length | 54675 |
|---|---|
| Median length | 182 |
| Mean length | 17.2249567 |
| Min length | 1 |
Unique
| Unique | 69904 ? |
|---|---|
| Unique (%) | 2.3% |
Sample
| 1st row | G. Hendler |
|---|---|
| 2nd row | R. C. Rollins & D. Rollins |
| 3rd row | T. Vaughan |
| 4th row | D. Harper |
| 5th row | F. Harvey |
| Value | Count | Frequency (%) |
| 667749 | 6.3% | |
| j | 491479 | 4.7% |
| a | 393168 | 3.7% |
| r | 369249 | 3.5% |
| e | 349861 | 3.3% |
| c | 335018 | 3.2% |
| m | 318954 | 3.0% |
| h | 289384 | 2.7% |
| w | 252512 | 2.4% |
| l | 232486 | 2.2% |
| Other values (54376) | 6868027 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7557395 | 14.6% | |
| . | 4811467 | 9.3% |
| e | 3584881 | 6.9% |
| a | 2593680 | 5.0% |
| r | 2503087 | 4.8% |
| n | 2366124 | 4.6% |
| o | 2354656 | 4.5% |
| i | 2159658 | 4.2% |
| l | 1869857 | 3.6% |
| t | 1864639 | 3.6% |
| Other values (148) | 20148087 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27912312 | |
| Uppercase Letter | 10166557 | 19.6% |
| Space Separator | 7557395 | 14.6% |
| Other Punctuation | 5908704 | 11.4% |
| Dash Punctuation | 168034 | 0.3% |
| Close Punctuation | 35074 | 0.1% |
| Open Punctuation | 35044 | 0.1% |
| Decimal Number | 17237 | < 0.1% |
| Control | 13056 | < 0.1% |
| Math Symbol | 93 | < 0.1% |
| Other values (5) | 25 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3584881 | |
| a | 2593680 | |
| r | 2503087 | |
| n | 2366124 | 8.5% |
| o | 2354656 | 8.4% |
| i | 2159658 | 7.7% |
| l | 1869857 | 6.7% |
| t | 1864639 | 6.7% |
| s | 1665441 | 6.0% |
| h | 836236 | 3.0% |
| Other values (66) | 6114053 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 931299 | 9.2% |
| S | 892293 | 8.8% |
| C | 776035 | 7.6% |
| R | 639220 | 6.3% |
| H | 636722 | 6.3% |
| B | 616217 | 6.1% |
| J | 590544 | 5.8% |
| A | 579459 | 5.7% |
| L | 545623 | 5.4% |
| W | 487694 | 4.8% |
| Other values (34) | 3471451 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4811467 | |
| & | 594583 | 10.1% |
| , | 394712 | 6.7% |
| / | 99770 | 1.7% |
| ' | 6577 | 0.1% |
| : | 762 | < 0.1% |
| " | 711 | < 0.1% |
| ? | 81 | < 0.1% |
| ; | 32 | < 0.1% |
| # | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3448 | |
| 9 | 2478 | |
| 8 | 2316 | |
| 0 | 2065 | |
| 2 | 1533 | |
| 3 | 1425 | |
| 4 | 1320 | 7.7% |
| 5 | 1138 | 6.6% |
| 6 | 815 | 4.7% |
| 7 | 699 | 4.1% |
Control
| Value | Count | Frequency (%) |
| 12986 | ||
| 68 | 0.5% | |
| | 1 | < 0.1% |
| | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 27046 | |
| ( | 7998 | 22.8% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 27044 | |
| ) | 8030 | 22.9% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 84 | |
| + | 9 | 9.7% |
Space Separator
| Value | Count | Frequency (%) |
| 7557395 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 168034 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 16 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 6 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38078867 | |
| Common | 13734662 | 26.5% |
| Greek | 1 | < 0.1% |
| Cyrillic | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3584881 | 9.4% |
| a | 2593680 | 6.8% |
| r | 2503087 | 6.6% |
| n | 2366124 | 6.2% |
| o | 2354656 | 6.2% |
| i | 2159658 | 5.7% |
| l | 1869857 | 4.9% |
| t | 1864639 | 4.9% |
| s | 1665441 | 4.4% |
| M | 931299 | 2.4% |
| Other values (108) | 16185545 |
Common
| Value | Count | Frequency (%) |
| 7557395 | ||
| . | 4811467 | |
| & | 594583 | 4.3% |
| , | 394712 | 2.9% |
| - | 168034 | 1.2% |
| / | 99770 | 0.7% |
| [ | 27046 | 0.2% |
| ] | 27044 | 0.2% |
| 12986 | 0.1% | |
| ) | 8030 | 0.1% |
| Other values (28) | 33595 | 0.2% |
Greek
| Value | Count | Frequency (%) |
| β | 1 |
Cyrillic
| Value | Count | Frequency (%) |
| Ӧ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51710194 | |
| None | 103334 | 0.2% |
| IPA Ext | 2 | < 0.1% |
| Cyrillic | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7557395 | 14.6% | |
| . | 4811467 | 9.3% |
| e | 3584881 | 6.9% |
| a | 2593680 | 5.0% |
| r | 2503087 | 4.8% |
| n | 2366124 | 4.6% |
| o | 2354656 | 4.6% |
| i | 2159658 | 4.2% |
| l | 1869857 | 3.6% |
| t | 1864639 | 3.6% |
| Other values (72) | 20044750 |
None
| Value | Count | Frequency (%) |
| á | 17609 | |
| é | 17464 | |
| ó | 16022 | |
| í | 11942 | |
| ñ | 10358 | |
| è | 7115 | |
| ü | 5658 | 5.5% |
| ö | 4415 | 4.3% |
| ê | 2853 | 2.8% |
| ç | 1312 | 1.3% |
| Other values (64) | 8586 |
IPA Ext
| Value | Count | Frequency (%) |
| ɶ | 2 |
Cyrillic
| Value | Count | Frequency (%) |
| Ӧ | 1 |
individualCount
Text
| Distinct | 968 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1634 |
| Missing (%) | < 0.1% |
| Memory size | 29.1 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.031925014 |
| Min length | 1 |
Unique
| Unique | 362 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 31 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 4 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 3306583 | |
| 2 | 152239 | 4.0% |
| 3 | 74520 | 2.0% |
| 4 | 53383 | 1.4% |
| 5 | 38629 | 1.0% |
| 6 | 27032 | 0.7% |
| 10 | 19695 | 0.5% |
| 7 | 17153 | 0.4% |
| 8 | 15778 | 0.4% |
| 9 | 10348 | 0.3% |
| Other values (958) | 97105 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3383057 | |
| 2 | 185717 | 4.7% |
| 3 | 91639 | 2.3% |
| 4 | 66277 | 1.7% |
| 5 | 59210 | 1.5% |
| 0 | 50050 | 1.3% |
| 6 | 35555 | 0.9% |
| 7 | 24584 | 0.6% |
| 8 | 22275 | 0.6% |
| 9 | 15814 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3934178 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3383057 | |
| 2 | 185717 | 4.7% |
| 3 | 91639 | 2.3% |
| 4 | 66277 | 1.7% |
| 5 | 59210 | 1.5% |
| 0 | 50050 | 1.3% |
| 6 | 35555 | 0.9% |
| 7 | 24584 | 0.6% |
| 8 | 22275 | 0.6% |
| 9 | 15814 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3934178 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3383057 | |
| 2 | 185717 | 4.7% |
| 3 | 91639 | 2.3% |
| 4 | 66277 | 1.7% |
| 5 | 59210 | 1.5% |
| 0 | 50050 | 1.3% |
| 6 | 35555 | 0.9% |
| 7 | 24584 | 0.6% |
| 8 | 22275 | 0.6% |
| 9 | 15814 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3934178 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3383057 | |
| 2 | 185717 | 4.7% |
| 3 | 91639 | 2.3% |
| 4 | 66277 | 1.7% |
| 5 | 59210 | 1.5% |
| 0 | 50050 | 1.3% |
| 6 | 35555 | 0.9% |
| 7 | 24584 | 0.6% |
| 8 | 22275 | 0.6% |
| 9 | 15814 | 0.4% |
sex
Text
Missing 
| Distinct | 266 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3118857 |
| Missing (%) | 81.8% |
| Memory size | 29.1 MiB |
Length
| Max length | 76 |
|---|---|
| Median length | 75 |
| Mean length | 5.596934593 |
| Min length | 1 |
Unique
| Unique | 110 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | unknown |
|---|---|
| 2nd row | Female |
| 3rd row | Male |
| 4th row | Male |
| 5th row | Male |
| Value | Count | Frequency (%) |
| male | 343203 | |
| female | 285482 | |
| unknown | 98697 | 13.5% |
| worker | 2922 | 0.4% |
| sex | 1719 | 0.2% |
| 731 | 0.1% | |
| hermaphrodite | 126 | < 0.1% |
| multiple | 119 | < 0.1% |
| animals | 119 | < 0.1% |
| of | 119 | < 0.1% |
| Other values (15) | 560 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 919476 | |
| a | 628885 | |
| l | 628877 | |
| m | 340648 | 8.8% |
| n | 296493 | 7.6% |
| M | 288718 | 7.4% |
| F | 227282 | 5.8% |
| o | 101977 | 2.6% |
| k | 101620 | 2.6% |
| w | 98773 | 2.5% |
| Other values (26) | 258475 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3222263 | |
| Uppercase Letter | 584061 | 15.0% |
| Other Punctuation | 46345 | 1.2% |
| Space Separator | 38555 | 1.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 919476 | |
| a | 628885 | |
| l | 628877 | |
| m | 340648 | 10.6% |
| n | 296493 | 9.2% |
| o | 101977 | 3.2% |
| k | 101620 | 3.2% |
| w | 98773 | 3.1% |
| f | 58364 | 1.8% |
| u | 36551 | 1.1% |
| Other values (10) | 10599 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 288718 | |
| F | 227282 | |
| U | 62378 | 10.7% |
| W | 2848 | 0.5% |
| S | 1600 | 0.3% |
| E | 508 | 0.1% |
| L | 362 | 0.1% |
| A | 362 | 0.1% |
| I | 2 | < 0.1% |
| P | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 45763 | |
| & | 490 | 1.1% |
| ? | 48 | 0.1% |
| / | 42 | 0.1% |
| , | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 38555 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3806324 | |
| Common | 84900 | 2.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 919476 | |
| a | 628885 | |
| l | 628877 | |
| m | 340648 | 8.9% |
| n | 296493 | 7.8% |
| M | 288718 | 7.6% |
| F | 227282 | 6.0% |
| o | 101977 | 2.7% |
| k | 101620 | 2.7% |
| w | 98773 | 2.6% |
| Other values (20) | 173575 | 4.6% |
Common
| Value | Count | Frequency (%) |
| ; | 45763 | |
| 38555 | ||
| & | 490 | 0.6% |
| ? | 48 | 0.1% |
| / | 42 | < 0.1% |
| , | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3891224 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 919476 | |
| a | 628885 | |
| l | 628877 | |
| m | 340648 | 8.8% |
| n | 296493 | 7.6% |
| M | 288718 | 7.4% |
| F | 227282 | 5.8% |
| o | 101977 | 2.6% |
| k | 101620 | 2.6% |
| w | 98773 | 2.5% |
| Other values (26) | 258475 | 6.6% |
lifeStage
Text
Missing 
| Distinct | 1019 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3359653 |
| Missing (%) | 88.1% |
| Memory size | 29.1 MiB |
Length
| Max length | 80 |
|---|---|
| Median length | 5 |
| Mean length | 7.432049132 |
| Min length | 1 |
Unique
| Unique | 393 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Adult |
|---|---|
| 2nd row | Adult |
| 3rd row | Adult |
| 4th row | Fruiting |
| 5th row | phyllosoma VII |
| Value | Count | Frequency (%) |
| adult | 229330 | |
| flowering | 95850 | |
| fruiting | 41812 | 8.0% |
| juvenile | 34834 | 6.7% |
| and | 17801 | 3.4% |
| immature | 16677 | 3.2% |
| vegetative | 9757 | 1.9% |
| fertile | 7533 | 1.4% |
| 7209 | 1.4% | |
| ovigerous | 6452 | 1.2% |
| Other values (344) | 52858 | 10.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 383349 | |
| u | 343495 | |
| t | 328549 | |
| e | 256052 | 7.6% |
| d | 254357 | 7.5% |
| i | 250388 | 7.4% |
| A | 209190 | 6.2% |
| n | 199864 | 5.9% |
| r | 192680 | 5.7% |
| g | 159378 | 4.7% |
| Other values (63) | 800163 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2841564 | |
| Uppercase Letter | 436961 | 12.9% |
| Space Separator | 65667 | 1.9% |
| Other Punctuation | 32873 | 1.0% |
| Dash Punctuation | 157 | < 0.1% |
| Decimal Number | 139 | < 0.1% |
| Open Punctuation | 52 | < 0.1% |
| Close Punctuation | 52 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 383349 | |
| u | 343495 | |
| t | 328549 | |
| e | 256052 | |
| d | 254357 | |
| i | 250388 | |
| n | 199864 | |
| r | 192680 | |
| g | 159378 | |
| o | 120863 | 4.3% |
| Other values (17) | 352589 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 209190 | |
| F | 147747 | |
| I | 33460 | 7.7% |
| J | 17075 | 3.9% |
| V | 9929 | 2.3% |
| L | 5147 | 1.2% |
| S | 3481 | 0.8% |
| W | 1839 | 0.4% |
| E | 1600 | 0.4% |
| C | 1592 | 0.4% |
| Other values (15) | 5901 | 1.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 58 | |
| 2 | 39 | |
| 3 | 17 | 12.2% |
| 4 | 14 | 10.1% |
| 5 | 8 | 5.8% |
| 8 | 1 | 0.7% |
| 9 | 1 | 0.7% |
| 6 | 1 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 32549 | |
| ? | 175 | 0.5% |
| & | 81 | 0.2% |
| / | 28 | 0.1% |
| , | 19 | 0.1% |
| . | 12 | < 0.1% |
| ' | 9 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 45 | |
| [ | 7 | 13.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 45 | |
| ] | 7 | 13.5% |
Space Separator
| Value | Count | Frequency (%) |
| 65667 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 157 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3278525 | |
| Common | 98940 | 2.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 383349 | |
| u | 343495 | |
| t | 328549 | |
| e | 256052 | 7.8% |
| d | 254357 | 7.8% |
| i | 250388 | 7.6% |
| A | 209190 | 6.4% |
| n | 199864 | 6.1% |
| r | 192680 | 5.9% |
| g | 159378 | 4.9% |
| Other values (42) | 701223 |
Common
| Value | Count | Frequency (%) |
| 65667 | ||
| ; | 32549 | |
| ? | 175 | 0.2% |
| - | 157 | 0.2% |
| & | 81 | 0.1% |
| 1 | 58 | 0.1% |
| ( | 45 | < 0.1% |
| ) | 45 | < 0.1% |
| 2 | 39 | < 0.1% |
| / | 28 | < 0.1% |
| Other values (11) | 96 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3377452 | |
| None | 13 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 383349 | |
| u | 343495 | |
| t | 328549 | |
| e | 256052 | 7.6% |
| d | 254357 | 7.5% |
| i | 250388 | 7.4% |
| A | 209190 | 6.2% |
| n | 199864 | 5.9% |
| r | 192680 | 5.7% |
| g | 159378 | 4.7% |
| Other values (61) | 800150 |
None
| Value | Count | Frequency (%) |
| ü | 9 | |
| í | 4 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 58 |
|---|---|
| Median length | 58 |
| Mean length | 58 |
| Min length | 58 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Animalia, Chordata, Vertebrata, Amphibia, Anura, Bufonidae |
|---|
| Value | Count | Frequency (%) |
| animalia | 1 | |
| chordata | 1 | |
| vertebrata | 1 | |
| amphibia | 1 | |
| anura | 1 | |
| bufonidae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 9 | |
| i | 5 | 8.6% |
| , | 5 | 8.6% |
| 5 | 8.6% | |
| r | 4 | 6.9% |
| A | 3 | 5.2% |
| e | 3 | 5.2% |
| t | 3 | 5.2% |
| n | 3 | 5.2% |
| d | 2 | 3.4% |
| Other values (11) | 16 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 42 | |
| Uppercase Letter | 6 | 10.3% |
| Other Punctuation | 5 | 8.6% |
| Space Separator | 5 | 8.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9 | |
| i | 5 | |
| r | 4 | |
| e | 3 | 7.1% |
| t | 3 | 7.1% |
| n | 3 | 7.1% |
| d | 2 | 4.8% |
| u | 2 | 4.8% |
| b | 2 | 4.8% |
| o | 2 | 4.8% |
| Other values (5) | 7 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3 | |
| V | 1 | 16.7% |
| C | 1 | 16.7% |
| B | 1 | 16.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 48 | |
| Common | 10 | 17.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9 | |
| i | 5 | |
| r | 4 | 8.3% |
| A | 3 | 6.2% |
| e | 3 | 6.2% |
| t | 3 | 6.2% |
| n | 3 | 6.2% |
| d | 2 | 4.2% |
| u | 2 | 4.2% |
| b | 2 | 4.2% |
| Other values (9) | 12 |
Common
| Value | Count | Frequency (%) |
| , | 5 | |
| 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 58 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 9 | |
| i | 5 | 8.6% |
| , | 5 | 8.6% |
| 5 | 8.6% | |
| r | 4 | 6.9% |
| A | 3 | 5.2% |
| e | 3 | 5.2% |
| t | 3 | 5.2% |
| n | 3 | 5.2% |
| d | 2 | 3.4% |
| Other values (11) | 16 |
caste
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Animalia |
|---|
| Value | Count | Frequency (%) |
| animalia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2 | |
| a | 2 | |
| A | 1 | |
| n | 1 | |
| m | 1 | |
| l | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 1 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2 | |
| a | 2 | |
| n | 1 | |
| m | 1 | |
| l | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2 | |
| a | 2 | |
| A | 1 | |
| n | 1 | |
| m | 1 | |
| l | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2 | |
| a | 2 | |
| A | 1 | |
| n | 1 | |
| m | 1 | |
| l | 1 |
behavior
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Chordata |
|---|
| Value | Count | Frequency (%) |
| chordata | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| C | 1 | |
| h | 1 | |
| o | 1 | |
| r | 1 | |
| d | 1 | |
| t | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 1 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| h | 1 | |
| o | 1 | |
| r | 1 | |
| d | 1 | |
| t | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| C | 1 | |
| h | 1 | |
| o | 1 | |
| r | 1 | |
| d | 1 | |
| t | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| C | 1 | |
| h | 1 | |
| o | 1 | |
| r | 1 | |
| d | 1 | |
| t | 1 |
vitality
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Amphibia |
|---|
| Value | Count | Frequency (%) |
| amphibia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2 | |
| A | 1 | |
| m | 1 | |
| p | 1 | |
| h | 1 | |
| b | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 1 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2 | |
| m | 1 | |
| p | 1 | |
| h | 1 | |
| b | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2 | |
| A | 1 | |
| m | 1 | |
| p | 1 | |
| h | 1 | |
| b | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2 | |
| A | 1 | |
| m | 1 | |
| p | 1 | |
| h | 1 | |
| b | 1 | |
| a | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Anura |
|---|
| Value | Count | Frequency (%) |
| anura | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1 | |
| n | 1 | |
| u | 1 | |
| r | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4 | |
| Uppercase Letter | 1 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1 | |
| u | 1 | |
| r | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1 | |
| n | 1 | |
| u | 1 | |
| r | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1 | |
| n | 1 | |
| u | 1 | |
| r | 1 | |
| a | 1 |
pathway
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Bufonidae |
|---|
| Value | Count | Frequency (%) |
| bufonidae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 1 | |
| u | 1 | |
| f | 1 | |
| o | 1 | |
| n | 1 | |
| i | 1 | |
| d | 1 | |
| a | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8 | |
| Uppercase Letter | 1 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 1 | |
| f | 1 | |
| o | 1 | |
| n | 1 | |
| i | 1 | |
| d | 1 | |
| a | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 1 | |
| u | 1 | |
| f | 1 | |
| o | 1 | |
| n | 1 | |
| i | 1 | |
| d | 1 | |
| a | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 1 | |
| u | 1 | |
| f | 1 | |
| o | 1 | |
| n | 1 | |
| i | 1 | |
| d | 1 | |
| a | 1 | |
| e | 1 |
preparations
Text
Missing 
| Distinct | 1356 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1975286 |
| Missing (%) | 51.8% |
| Memory size | 29.1 MiB |
Length
| Max length | 192 |
|---|---|
| Median length | 157 |
| Mean length | 9.648374794 |
| Min length | 1 |
Unique
| Unique | 545 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Alcohol (Ethanol) |
|---|---|
| 2nd row | Ethanol |
| 3rd row | Dry |
| 4th row | Alcohol (Ethanol) |
| 5th row | Pinned |
| Value | Count | Frequency (%) |
| ethanol | 603904 | |
| dry | 379221 | |
| alcohol | 369929 | |
| skin | 344604 | |
| whole | 220441 | 8.0% |
| skull | 185981 | 6.7% |
| pinned | 160804 | 5.8% |
| slide | 80393 | 2.9% |
| fluid | 55206 | 2.0% |
| envelope | 47332 | 1.7% |
| Other values (251) | 322339 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 2282490 | 12.9% |
| o | 1831241 | 10.3% |
| n | 1442452 | 8.1% |
| h | 1247138 | 7.0% |
| 931341 | 5.2% | |
| i | 795533 | 4.5% |
| e | 770760 | 4.3% |
| a | 765971 | 4.3% |
| t | 743634 | 4.2% |
| S | 684993 | 3.9% |
| Other values (64) | 6246004 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12811436 | |
| Uppercase Letter | 2720642 | 15.3% |
| Space Separator | 931341 | 5.2% |
| Other Punctuation | 461552 | 2.6% |
| Open Punctuation | 396222 | 2.2% |
| Close Punctuation | 396222 | 2.2% |
| Decimal Number | 16200 | 0.1% |
| Dash Punctuation | 7942 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2282490 | |
| o | 1831241 | |
| n | 1442452 | |
| h | 1247138 | |
| i | 795533 | 6.2% |
| e | 770760 | 6.0% |
| a | 765971 | 6.0% |
| t | 743634 | 5.8% |
| k | 580419 | 4.5% |
| r | 510603 | 4.0% |
| Other values (16) | 1841195 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 684993 | |
| E | 669745 | |
| D | 380446 | |
| A | 375707 | |
| W | 238393 | 8.8% |
| P | 194234 | 7.1% |
| F | 72380 | 2.7% |
| M | 25443 | 0.9% |
| B | 12825 | 0.5% |
| L | 10936 | 0.4% |
| Other values (15) | 55540 | 2.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 7489 | |
| 5 | 7357 | |
| 0 | 743 | 4.6% |
| 8 | 412 | 2.5% |
| 7 | 165 | 1.0% |
| 1 | 16 | 0.1% |
| 2 | 15 | 0.1% |
| 3 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 231665 | |
| ; | 218925 | |
| % | 8082 | 1.8% |
| & | 1368 | 0.3% |
| / | 1345 | 0.3% |
| . | 105 | < 0.1% |
| , | 59 | < 0.1% |
| ? | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 394620 | |
| [ | 1602 | 0.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 394620 | |
| ] | 1602 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 931341 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7942 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15532078 | |
| Common | 2209479 | 12.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 2282490 | |
| o | 1831241 | |
| n | 1442452 | 9.3% |
| h | 1247138 | 8.0% |
| i | 795533 | 5.1% |
| e | 770760 | 5.0% |
| a | 765971 | 4.9% |
| t | 743634 | 4.8% |
| S | 684993 | 4.4% |
| E | 669745 | 4.3% |
| Other values (41) | 4298121 |
Common
| Value | Count | Frequency (%) |
| 931341 | ||
| ( | 394620 | |
| ) | 394620 | |
| : | 231665 | 10.5% |
| ; | 218925 | 9.9% |
| % | 8082 | 0.4% |
| - | 7942 | 0.4% |
| 9 | 7489 | 0.3% |
| 5 | 7357 | 0.3% |
| ] | 1602 | 0.1% |
| Other values (13) | 5836 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17741557 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 2282490 | 12.9% |
| o | 1831241 | 10.3% |
| n | 1442452 | 8.1% |
| h | 1247138 | 7.0% |
| 931341 | 5.2% | |
| i | 795533 | 4.5% |
| e | 770760 | 4.3% |
| a | 765971 | 4.3% |
| t | 743634 | 4.2% |
| S | 684993 | 3.9% |
| Other values (64) | 6246004 |
disposition
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Rhinella |
|---|
| Value | Count | Frequency (%) |
| rhinella | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 2 | |
| R | 1 | |
| h | 1 | |
| i | 1 | |
| n | 1 | |
| e | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 1 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2 | |
| h | 1 | |
| i | 1 | |
| n | 1 | |
| e | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 2 | |
| R | 1 | |
| h | 1 | |
| i | 1 | |
| n | 1 | |
| e | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 2 | |
| R | 1 | |
| h | 1 | |
| i | 1 | |
| n | 1 | |
| e | 1 | |
| a | 1 |
associatedMedia
Text
Missing 
| Distinct | 2007097 |
|---|---|
| Distinct (%) | 83.0% |
| Missing | 1396847 |
| Missing (%) | 36.6% |
| Memory size | 29.1 MiB |
Length
| Max length | 1040 |
|---|---|
| Median length | 49 |
| Mean length | 50.09518329 |
| Min length | 42 |
Unique
| Unique | 1946069 ? |
|---|---|
| Unique (%) | 80.5% |
Sample
| 1st row | https://collections.nmnh.si.edu/media/?i=14071815 |
|---|---|
| 2nd row | https://collections.nmnh.si.edu/media/?i=15812604 |
| 3rd row | https://collections.nmnh.si.edu/media/?i=16381603 |
| 4th row | https://collections.nmnh.si.edu/media/?i=15690882 |
| 5th row | https://collections.nmnh.si.edu/media/?i=14020520 |
| Value | Count | Frequency (%) |
| 14558510 | 1287 | < 0.1% |
| 14894714 | 1283 | < 0.1% |
| 14888503 | 1224 | < 0.1% |
| 14888504 | 881 | < 0.1% |
| 5000376 | 839 | < 0.1% |
| 5000375 | 839 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=10674432 | 657 | < 0.1% |
| 15777181 | 615 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=10689696 | 591 | < 0.1% |
| 15596573 | 565 | < 0.1% |
| Other values (2238001) | 2687369 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 9669008 | 8.0% |
| i | 9669008 | 8.0% |
| s | 7251756 | 6.0% |
| e | 7251756 | 6.0% |
| n | 7251756 | 6.0% |
| . | 7251756 | 6.0% |
| t | 7251756 | 6.0% |
| h | 4834504 | 4.0% |
| c | 4834504 | 4.0% |
| o | 4834504 | 4.0% |
| Other values (21) | 50992374 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 74934812 | |
| Other Punctuation | 22034168 | 18.2% |
| Decimal Number | 21427552 | 17.7% |
| Math Symbol | 2417252 | 2.0% |
| Space Separator | 278898 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 9669008 | |
| s | 7251756 | |
| e | 7251756 | |
| n | 7251756 | |
| t | 7251756 | |
| h | 4834504 | 6.5% |
| c | 4834504 | 6.5% |
| o | 4834504 | 6.5% |
| l | 4834504 | 6.5% |
| m | 4834504 | 6.5% |
| Other values (4) | 12086260 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4679824 | |
| 5 | 2199813 | |
| 4 | 2170502 | |
| 3 | 1974185 | |
| 2 | 1952018 | |
| 0 | 1830529 | 8.5% |
| 6 | 1794059 | 8.4% |
| 8 | 1719074 | 8.0% |
| 7 | 1561962 | 7.3% |
| 9 | 1545586 | 7.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 9669008 | |
| . | 7251756 | |
| ? | 2417252 | 11.0% |
| : | 2417252 | 11.0% |
| ; | 278900 | 1.3% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 2417252 |
Space Separator
| Value | Count | Frequency (%) |
| 278898 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 74934812 | |
| Common | 46157870 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 9669008 | |
| . | 7251756 | |
| 1 | 4679824 | |
| ? | 2417252 | 5.2% |
| = | 2417252 | 5.2% |
| : | 2417252 | 5.2% |
| 5 | 2199813 | 4.8% |
| 4 | 2170502 | 4.7% |
| 3 | 1974185 | 4.3% |
| 2 | 1952018 | 4.2% |
| Other values (7) | 9009008 |
Latin
| Value | Count | Frequency (%) |
| i | 9669008 | |
| s | 7251756 | |
| e | 7251756 | |
| n | 7251756 | |
| t | 7251756 | |
| h | 4834504 | 6.5% |
| c | 4834504 | 6.5% |
| o | 4834504 | 6.5% |
| l | 4834504 | 6.5% |
| m | 4834504 | 6.5% |
| Other values (4) | 12086260 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 121092682 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 9669008 | 8.0% |
| i | 9669008 | 8.0% |
| s | 7251756 | 6.0% |
| e | 7251756 | 6.0% |
| n | 7251756 | 6.0% |
| . | 7251756 | 6.0% |
| t | 7251756 | 6.0% |
| h | 4834504 | 4.0% |
| c | 4834504 | 4.0% |
| o | 4834504 | 4.0% |
| Other values (21) | 50992374 |
Missing 
| Distinct | 5043 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 3809026 |
| Missing (%) | 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 12558 |
|---|---|
| Median length | 49 |
| Mean length | 104.1133452 |
| Min length | 21 |
Unique
| Unique | 5032 ? |
|---|---|
| Unique (%) | 99.2% |
Sample
| 1st row | https://www.ncbi.nlm.nih.gov/gquery?term=KM080038 |
|---|---|
| 2nd row | https://www.ncbi.nlm.nih.gov/gquery?term=EU823242|https://www.ncbi.nlm.nih.gov/gquery?term=EU823167|https://www.ncbi.nlm.nih.gov/gquery?term=KC246618 |
| 3rd row | https://www.ncbi.nlm.nih.gov/gquery?term=MN549733 |
| 4th row | https://www.ncbi.nlm.nih.gov/gquery?term=KC771789|https://www.ncbi.nlm.nih.gov/gquery?term=KC771632 |
| 5th row | https://www.ncbi.nlm.nih.gov/gquery?term=HQ600894 |
| Value | Count | Frequency (%) |
| https://www.ncbi.nlm.nih.gov/gquery?term=prjna521985 | 12 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=km521547 | 8 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay273864 | 4 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay273835 | 3 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay273832 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj207364 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kf989555|https://www.ncbi.nlm.nih.gov/gquery?term=kf989872|https://www.ncbi.nlm.nih.gov/gquery?term=kf989774|https://www.ncbi.nlm.nih.gov/gquery?term=kf989974|https://www.ncbi.nlm.nih.gov/gquery?term=kf989663 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mh244118 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kp739770 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn837192|https://www.ncbi.nlm.nih.gov/gquery?term=jn837282|https://www.ncbi.nlm.nih.gov/gquery?term=jn837372|https://www.ncbi.nlm.nih.gov/gquery?term=jn837475 | 2 | < 0.1% |
| Other values (5034) | 5035 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 42512 | 8.0% |
| t | 31870 | 6.0% |
| / | 31869 | 6.0% |
| w | 31869 | 6.0% |
| n | 31869 | 6.0% |
| r | 21250 | 4.0% |
| i | 21248 | 4.0% |
| g | 21248 | 4.0% |
| e | 21247 | 4.0% |
| m | 21247 | 4.0% |
| Other values (57) | 251938 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 329331 | |
| Other Punctuation | 95629 | 18.1% |
| Decimal Number | 64584 | 12.2% |
| Uppercase Letter | 22264 | 4.2% |
| Math Symbol | 16174 | 3.1% |
| Dash Punctuation | 183 | < 0.1% |
| Space Separator | 1 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 4071 | |
| M | 2718 | |
| J | 2341 | |
| U | 1725 | 7.7% |
| Q | 1614 | 7.2% |
| F | 1316 | 5.9% |
| E | 865 | 3.9% |
| R | 815 | 3.7% |
| T | 732 | 3.3% |
| W | 728 | 3.3% |
| Other values (16) | 5339 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 31870 | 9.7% |
| w | 31869 | 9.7% |
| n | 31869 | 9.7% |
| r | 21250 | 6.5% |
| i | 21248 | 6.5% |
| g | 21248 | 6.5% |
| e | 21247 | 6.5% |
| m | 21247 | 6.5% |
| h | 21246 | 6.5% |
| o | 10624 | 3.2% |
| Other values (11) | 95613 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 7494 | |
| 2 | 7086 | |
| 4 | 6564 | |
| 8 | 6478 | |
| 1 | 6347 | |
| 9 | 6334 | |
| 6 | 6120 | |
| 3 | 6086 | |
| 0 | 6082 | |
| 5 | 5993 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 42512 | |
| / | 31869 | |
| : | 10623 | 11.1% |
| ? | 10623 | 11.1% |
| " | 2 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 10623 | |
| | | 5551 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 183 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 351595 | |
| Common | 176572 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 31870 | 9.1% |
| w | 31869 | 9.1% |
| n | 31869 | 9.1% |
| r | 21250 | 6.0% |
| i | 21248 | 6.0% |
| g | 21248 | 6.0% |
| e | 21247 | 6.0% |
| m | 21247 | 6.0% |
| h | 21246 | 6.0% |
| o | 10624 | 3.0% |
| Other values (37) | 117877 |
Common
| Value | Count | Frequency (%) |
| . | 42512 | |
| / | 31869 | |
| : | 10623 | 6.0% |
| ? | 10623 | 6.0% |
| = | 10623 | 6.0% |
| 7 | 7494 | 4.2% |
| 2 | 7086 | 4.0% |
| 4 | 6564 | 3.7% |
| 8 | 6478 | 3.7% |
| 1 | 6347 | 3.6% |
| Other values (10) | 36353 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 528167 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 42512 | 8.0% |
| t | 31870 | 6.0% |
| / | 31869 | 6.0% |
| w | 31869 | 6.0% |
| n | 31869 | 6.0% |
| r | 21250 | 4.0% |
| i | 21248 | 4.0% |
| g | 21248 | 4.0% |
| e | 21247 | 4.0% |
| m | 21247 | 4.0% |
| Other values (57) | 251938 |
Missing 
| Distinct | 253750 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 3306658 |
| Missing (%) | 86.7% |
| Memory size | 29.1 MiB |
Length
| Max length | 233869 |
|---|---|
| Median length | 3014 |
| Mean length | 66.88194293 |
| Min length | 1 |
Unique
| Unique | 219585 ? |
|---|---|
| Unique (%) | 43.3% |
Sample
| 1st row | Ninoe sp. B |
|---|---|
| 2nd row | {"hostGen":"Wallago","hostSpec":"after","hostBodyLoc":"stomach"}; Original USNPC preservative was a solution of 70% ethanol, 3% formalin, and 2% glycerine |
| 3rd row | {"hostGen":"Catoptrophorus","hostSpec":"semipalmatus","hostBodyLoc":"esophagus","hostFldNo":"JEBadley-426-23"}; Glycerin jelly |
| 4th row | Scripps Institution of Oceanography library archives about M.J. Johnson Phyllosoma Collection: specimens were stained with fast green and are mounted mostly in Canada balsam, Harleco synthetic resin or diatex. |
| 5th row | 8/28/28; 6527; Orcutt; Chamberlain Coll |
| Value | Count | Frequency (%) |
| of | 102829 | 2.1% |
| by | 78963 | 1.6% |
| and | 73475 | 1.5% |
| the | 71216 | 1.5% |
| coll | 62126 | 1.3% |
| 56730 | 1.2% | |
| a | 55849 | 1.1% |
| to | 50755 | 1.0% |
| was | 43759 | 0.9% |
| in | 42473 | 0.9% |
| Other values (208280) | 4265119 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4357921 | 12.8% | |
| e | 2347617 | 6.9% |
| o | 1831790 | 5.4% |
| a | 1822117 | 5.4% |
| i | 1658183 | 4.9% |
| t | 1581641 | 4.7% |
| n | 1541699 | 4.5% |
| r | 1401352 | 4.1% |
| s | 1326121 | 3.9% |
| l | 1310982 | 3.9% |
| Other values (163) | 14759217 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20844026 | |
| Space Separator | 4357921 | 12.8% |
| Uppercase Letter | 3347998 | 9.9% |
| Other Punctuation | 2602178 | 7.7% |
| Decimal Number | 2132975 | 6.3% |
| Control | 203815 | 0.6% |
| Dash Punctuation | 172638 | 0.5% |
| Open Punctuation | 123338 | 0.4% |
| Close Punctuation | 123285 | 0.4% |
| Math Symbol | 23926 | 0.1% |
| Other values (10) | 6540 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2347617 | |
| o | 1831790 | 8.8% |
| a | 1822117 | 8.7% |
| i | 1658183 | 8.0% |
| t | 1581641 | 7.6% |
| n | 1541699 | 7.4% |
| r | 1401352 | 6.7% |
| s | 1326121 | 6.4% |
| l | 1310982 | 6.3% |
| d | 887403 | 4.3% |
| Other values (53) | 5135121 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 393212 | 11.7% |
| C | 350050 | 10.5% |
| P | 208690 | 6.2% |
| B | 182921 | 5.5% |
| N | 177358 | 5.3% |
| M | 176100 | 5.3% |
| F | 172695 | 5.2% |
| T | 155295 | 4.6% |
| A | 149808 | 4.5% |
| L | 143338 | 4.3% |
| Other values (27) | 1238531 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 768820 | |
| " | 514712 | |
| ; | 485790 | |
| , | 340235 | |
| : | 278626 | 10.7% |
| % | 69874 | 2.7% |
| / | 56498 | 2.2% |
| ! | 27039 | 1.0% |
| ' | 21725 | 0.8% |
| # | 18216 | 0.7% |
| Other values (9) | 20643 | 0.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 422009 | |
| 2 | 276489 | |
| 0 | 240230 | |
| 9 | 233861 | |
| 3 | 188131 | |
| 7 | 169939 | |
| 5 | 159142 | 7.5% |
| 6 | 155302 | 7.3% |
| 4 | 150818 | 7.1% |
| 8 | 137054 | 6.4% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 12697 | |
| + | 5742 | |
| | | 5218 | |
| ~ | 112 | 0.5% |
| > | 91 | 0.4% |
| < | 43 | 0.2% |
| × | 19 | 0.1% |
| ± | 4 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 1531 | |
| ♂ | 46 | 2.9% |
| ♀ | 21 | 1.3% |
| © | 9 | 0.6% |
| ⚥ | 5 | 0.3% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 15 | |
| ¼ | 2 | 9.1% |
| ¹ | 2 | 9.1% |
| ¾ | 2 | 9.1% |
| ³ | 1 | 4.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 171764 | |
| – | 865 | 0.5% |
| — | 9 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 82302 | |
| { | 36447 | |
| [ | 4589 | 3.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 82267 | |
| } | 36442 | |
| ] | 4576 | 3.7% |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 195 | |
| › | 5 | 2.5% |
| » | 1 | 0.5% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 138 | |
| ̀ | 46 | 20.0% |
| ̧ | 46 | 20.0% |
Control
| Value | Count | Frequency (%) |
| 202744 | ||
| 1071 | 0.5% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 185 | |
| « | 1 | 0.5% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 5 | |
| ´ | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 4357921 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3618 |
Other Letter
| Value | Count | Frequency (%) |
| º | 485 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 177 |
Format
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24192452 | |
| Common | 9745914 | |
| Inherited | 230 | < 0.1% |
| Greek | 44 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2347617 | 9.7% |
| o | 1831790 | 7.6% |
| a | 1822117 | 7.5% |
| i | 1658183 | 6.9% |
| t | 1581641 | 6.5% |
| n | 1541699 | 6.4% |
| r | 1401352 | 5.8% |
| s | 1326121 | 5.5% |
| l | 1310982 | 5.4% |
| d | 887403 | 3.7% |
| Other values (88) | 8483547 |
Common
| Value | Count | Frequency (%) |
| 4357921 | ||
| . | 768820 | 7.9% |
| " | 514712 | 5.3% |
| ; | 485790 | 5.0% |
| 1 | 422009 | 4.3% |
| , | 340235 | 3.5% |
| : | 278626 | 2.9% |
| 2 | 276489 | 2.8% |
| 0 | 240230 | 2.5% |
| 9 | 233861 | 2.4% |
| Other values (60) | 1827221 |
Inherited
| Value | Count | Frequency (%) |
| ́ | 138 | |
| ̀ | 46 | 20.0% |
| ̧ | 46 | 20.0% |
Greek
| Value | Count | Frequency (%) |
| μ | 43 | |
| π | 1 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33933102 | |
| None | 3879 | < 0.1% |
| Punctuation | 1357 | < 0.1% |
| Diacriticals | 230 | < 0.1% |
| Misc Symbols | 72 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4357921 | 12.8% | |
| e | 2347617 | 6.9% |
| o | 1831790 | 5.4% |
| a | 1822117 | 5.4% |
| i | 1658183 | 4.9% |
| t | 1581641 | 4.7% |
| n | 1541699 | 4.5% |
| r | 1401352 | 4.1% |
| s | 1326121 | 3.9% |
| l | 1310982 | 3.9% |
| Other values (86) | 14753679 |
None
| Value | Count | Frequency (%) |
| ° | 1531 | |
| º | 485 | 12.5% |
| é | 435 | 11.2% |
| í | 370 | 9.5% |
| ñ | 156 | 4.0% |
| á | 151 | 3.9% |
| · | 87 | 2.2% |
| ã | 75 | 1.9% |
| ü | 75 | 1.9% |
| ó | 74 | 1.9% |
| Other values (54) | 440 | 11.3% |
Punctuation
| Value | Count | Frequency (%) |
| – | 865 | |
| ” | 195 | 14.4% |
| “ | 185 | 13.6% |
| … | 74 | 5.5% |
| • | 24 | 1.8% |
| — | 9 | 0.7% |
| › | 5 | 0.4% |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 138 | |
| ̀ | 46 | 20.0% |
| ̧ | 46 | 20.0% |
Misc Symbols
| Value | Count | Frequency (%) |
| ♂ | 46 | |
| ♀ | 21 | |
| ⚥ | 5 | 6.9% |
organismName
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4.5 |
| Mean length | 4.5 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 69.0 |
|---|---|
| 2nd row | 720.0 |
| Value | Count | Frequency (%) |
| 69.0 | 1 | |
| 720.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3 | |
| . | 2 | |
| 6 | 1 | 11.1% |
| 9 | 1 | 11.1% |
| 7 | 1 | 11.1% |
| 2 | 1 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 | |
| Other Punctuation | 2 | 22.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 6 | 1 | 14.3% |
| 9 | 1 | 14.3% |
| 7 | 1 | 14.3% |
| 2 | 1 | 14.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3 | |
| . | 2 | |
| 6 | 1 | 11.1% |
| 9 | 1 | 11.1% |
| 7 | 1 | 11.1% |
| 2 | 1 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3 | |
| . | 2 | |
| 6 | 1 | 11.1% |
| 9 | 1 | 11.1% |
| 7 | 1 | 11.1% |
| 2 | 1 | 11.1% |
verbatimLabel
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North America, Canada, Nunavut, Baffin Island |
|---|
| Value | Count | Frequency (%) |
| north | 1 | |
| america | 1 | |
| canada | 1 | |
| nunavut | 1 | |
| baffin | 1 | |
| island | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7 | |
| 5 | 11.1% | |
| n | 4 | 8.9% |
| , | 3 | 6.7% |
| i | 2 | 4.4% |
| f | 2 | 4.4% |
| u | 2 | 4.4% |
| d | 2 | 4.4% |
| N | 2 | 4.4% |
| t | 2 | 4.4% |
| Other values (13) | 14 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31 | |
| Uppercase Letter | 6 | 13.3% |
| Space Separator | 5 | 11.1% |
| Other Punctuation | 3 | 6.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7 | |
| n | 4 | |
| i | 2 | 6.5% |
| f | 2 | 6.5% |
| u | 2 | 6.5% |
| d | 2 | 6.5% |
| t | 2 | 6.5% |
| r | 2 | 6.5% |
| e | 1 | 3.2% |
| c | 1 | 3.2% |
| Other values (6) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2 | |
| C | 1 | |
| A | 1 | |
| B | 1 | |
| I | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37 | |
| Common | 8 | 17.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7 | |
| n | 4 | 10.8% |
| i | 2 | 5.4% |
| f | 2 | 5.4% |
| u | 2 | 5.4% |
| d | 2 | 5.4% |
| N | 2 | 5.4% |
| t | 2 | 5.4% |
| r | 2 | 5.4% |
| e | 1 | 2.7% |
| Other values (11) | 11 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| , | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7 | |
| 5 | 11.1% | |
| n | 4 | 8.9% |
| , | 3 | 6.7% |
| i | 2 | 4.4% |
| f | 2 | 4.4% |
| u | 2 | 4.4% |
| d | 2 | 4.4% |
| N | 2 | 4.4% |
| t | 2 | 4.4% |
| Other values (13) | 14 |
materialSampleID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North America |
|---|
| Value | Count | Frequency (%) |
| north | 1 | |
| america | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2 | |
| N | 1 | |
| o | 1 | |
| t | 1 | |
| h | 1 | |
| 1 | ||
| A | 1 | |
| m | 1 | |
| e | 1 | |
| i | 1 | |
| Other values (2) | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 | |
| Uppercase Letter | 2 | 15.4% |
| Space Separator | 1 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2 | |
| o | 1 | |
| t | 1 | |
| h | 1 | |
| m | 1 | |
| e | 1 | |
| i | 1 | |
| c | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 | |
| A | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 | |
| Common | 1 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2 | |
| N | 1 | |
| o | 1 | |
| t | 1 | |
| h | 1 | |
| A | 1 | |
| m | 1 | |
| e | 1 | |
| i | 1 | |
| c | 1 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2 | |
| N | 1 | |
| o | 1 | |
| t | 1 | |
| h | 1 | |
| 1 | ||
| A | 1 | |
| m | 1 | |
| e | 1 | |
| i | 1 | |
| Other values (2) | 2 |
eventType
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814096 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.333333333 |
| Min length | 4 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -15.6527 |
|---|---|
| 2nd row | Baffin Island |
| 3rd row | 5.83 |
| Value | Count | Frequency (%) |
| 15.6527 | 1 | |
| baffin | 1 | |
| island | 1 | |
| 5.83 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 3 | 12.0% |
| f | 2 | 8.0% |
| n | 2 | 8.0% |
| . | 2 | 8.0% |
| a | 2 | 8.0% |
| 8 | 1 | 4.0% |
| d | 1 | 4.0% |
| l | 1 | 4.0% |
| s | 1 | 4.0% |
| I | 1 | 4.0% |
| Other values (9) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 | |
| Decimal Number | 9 | |
| Other Punctuation | 2 | 8.0% |
| Uppercase Letter | 2 | 8.0% |
| Space Separator | 1 | 4.0% |
| Dash Punctuation | 1 | 4.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 3 | |
| 8 | 1 | 11.1% |
| 1 | 1 | 11.1% |
| 7 | 1 | 11.1% |
| 2 | 1 | 11.1% |
| 6 | 1 | 11.1% |
| 3 | 1 | 11.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 2 | |
| n | 2 | |
| a | 2 | |
| d | 1 | |
| l | 1 | |
| s | 1 | |
| i | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1 | |
| B | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13 | |
| Latin | 12 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 3 | |
| . | 2 | |
| 8 | 1 | 7.7% |
| 1 | 7.7% | |
| - | 1 | 7.7% |
| 1 | 1 | 7.7% |
| 7 | 1 | 7.7% |
| 2 | 1 | 7.7% |
| 6 | 1 | 7.7% |
| 3 | 1 | 7.7% |
Latin
| Value | Count | Frequency (%) |
| f | 2 | |
| n | 2 | |
| a | 2 | |
| d | 1 | |
| l | 1 | |
| s | 1 | |
| I | 1 | |
| i | 1 | |
| B | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 3 | 12.0% |
| f | 2 | 8.0% |
| n | 2 | 8.0% |
| . | 2 | 8.0% |
| a | 2 | 8.0% |
| 8 | 1 | 4.0% |
| d | 1 | 4.0% |
| l | 1 | 4.0% |
| s | 1 | 4.0% |
| I | 1 | 4.0% |
| Other values (9) | 9 |
fieldNumber
Text
Missing 
| Distinct | 60674 |
|---|---|
| Distinct (%) | 19.1% |
| Missing | 3496495 |
| Missing (%) | 91.7% |
| Memory size | 29.1 MiB |
Length
| Max length | 97 |
|---|---|
| Median length | 64 |
| Mean length | 12.75618065 |
| Min length | 1 |
Unique
| Unique | 27322 ? |
|---|---|
| Unique (%) | 8.6% |
Sample
| 1st row | MMS-MAMES/B3:M4-4 |
|---|---|
| 2nd row | USARP/EL/9/740/USC |
| 3rd row | M165503; H.29-118 |
| 4th row | USFC/A5151 |
| 5th row | USARP/EL/6/369/USC |
| Value | Count | Frequency (%) |
| vgs | 7929 | 1.9% |
| mms-mafla/jar | 7004 | 1.7% |
| jtw | 5892 | 1.4% |
| bolland/rfb | 3098 | 0.7% |
| bbc | 2577 | 0.6% |
| 2230 | 0.5% | |
| humes | 2193 | 0.5% |
| jpem | 2085 | 0.5% |
| lwk | 1727 | 0.4% |
| lk | 1719 | 0.4% |
| Other values (57021) | 377011 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 305500 | 7.5% |
| S | 290933 | 7.2% |
| - | 276981 | 6.8% |
| 1 | 218982 | 5.4% |
| M | 216754 | 5.4% |
| 0 | 202603 | 5.0% |
| A | 192694 | 4.8% |
| 2 | 191794 | 4.7% |
| C | 163005 | 4.0% |
| 3 | 134147 | 3.3% |
| Other values (74) | 1858021 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1835614 | |
| Decimal Number | 1403440 | |
| Other Punctuation | 360416 | 8.9% |
| Dash Punctuation | 276981 | 6.8% |
| Space Separator | 95861 | 2.4% |
| Lowercase Letter | 73645 | 1.8% |
| Connector Punctuation | 3087 | 0.1% |
| Close Punctuation | 1107 | < 0.1% |
| Open Punctuation | 1106 | < 0.1% |
| Math Symbol | 154 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 290933 | |
| M | 216754 | |
| A | 192694 | |
| C | 163005 | 8.9% |
| U | 116671 | 6.4% |
| F | 102454 | 5.6% |
| L | 84366 | 4.6% |
| I | 83059 | 4.5% |
| R | 80833 | 4.4% |
| B | 78620 | 4.3% |
| Other values (16) | 426225 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11928 | |
| r | 11200 | |
| a | 10936 | |
| o | 5737 | |
| l | 4611 | 6.3% |
| i | 4055 | 5.5% |
| u | 3859 | 5.2% |
| s | 3658 | 5.0% |
| t | 3451 | 4.7% |
| m | 3145 | 4.3% |
| Other values (16) | 11065 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 305500 | |
| : | 33445 | 9.3% |
| ; | 15106 | 4.2% |
| . | 3639 | 1.0% |
| , | 1521 | 0.4% |
| # | 907 | 0.3% |
| \ | 150 | < 0.1% |
| ? | 57 | < 0.1% |
| & | 48 | < 0.1% |
| ' | 24 | < 0.1% |
| Other values (3) | 19 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 218982 | |
| 0 | 202603 | |
| 2 | 191794 | |
| 3 | 134147 | |
| 5 | 132922 | |
| 4 | 116940 | |
| 7 | 111223 | |
| 6 | 108012 | |
| 8 | 96459 | |
| 9 | 90358 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 148 | |
| = | 6 | 3.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 276981 |
Space Separator
| Value | Count | Frequency (%) |
| 95861 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3087 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1107 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1106 |
Final Punctuation
| Value | Count | Frequency (%) |
| › | 2 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2142155 | |
| Latin | 1909259 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 290933 | |
| M | 216754 | |
| A | 192694 | 10.1% |
| C | 163005 | 8.5% |
| U | 116671 | 6.1% |
| F | 102454 | 5.4% |
| L | 84366 | 4.4% |
| I | 83059 | 4.4% |
| R | 80833 | 4.2% |
| B | 78620 | 4.1% |
| Other values (42) | 499870 |
Common
| Value | Count | Frequency (%) |
| / | 305500 | |
| - | 276981 | |
| 1 | 218982 | |
| 0 | 202603 | |
| 2 | 191794 | |
| 3 | 134147 | 6.3% |
| 5 | 132922 | 6.2% |
| 4 | 116940 | 5.5% |
| 7 | 111223 | 5.2% |
| 6 | 108012 | 5.0% |
| Other values (22) | 343051 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4051412 | |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 305500 | 7.5% |
| S | 290933 | 7.2% |
| - | 276981 | 6.8% |
| 1 | 218982 | 5.4% |
| M | 216754 | 5.4% |
| 0 | 202603 | 5.0% |
| A | 192694 | 4.8% |
| 2 | 191794 | 4.7% |
| C | 163005 | 4.0% |
| 3 | 134147 | 3.3% |
| Other values (73) | 1858019 |
Punctuation
| Value | Count | Frequency (%) |
| › | 2 |
eventDate
Text
Missing 
| Distinct | 94419 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 653351 |
| Missing (%) | 17.1% |
| Memory size | 29.1 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 10 |
| Mean length | 10.11858142 |
| Min length | 4 |
Unique
| Unique | 21671 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 1981-04-24 |
|---|---|
| 2nd row | 1952-03-30 |
| 3rd row | 1958-08-06 |
| 4th row | 1900-11 |
| 5th row | 1988-08-20 |
| Value | Count | Frequency (%) |
| or | 3309 | 0.1% |
| 1838/1842 | 3220 | 0.1% |
| 1915 | 3013 | 0.1% |
| 1913 | 2523 | 0.1% |
| 1982-07-21 | 2373 | 0.1% |
| 1891 | 2257 | 0.1% |
| 1981-07-06 | 2204 | 0.1% |
| 1983-05-13 | 2158 | 0.1% |
| 1982-11-19 | 2081 | 0.1% |
| 1916 | 2046 | 0.1% |
| Other values (92244) | 3142184 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 6096876 | |
| 1 | 6057096 | |
| 0 | 4860793 | |
| 9 | 4024283 | |
| 2 | 2319326 | 7.3% |
| 8 | 1867324 | 5.8% |
| 7 | 1478663 | 4.6% |
| 6 | 1473117 | 4.6% |
| 3 | 1249910 | 3.9% |
| 5 | 1205209 | 3.8% |
| Other values (14) | 1349689 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 25667726 | |
| Dash Punctuation | 6096876 | 19.1% |
| Other Punctuation | 204437 | 0.6% |
| Space Separator | 6620 | < 0.1% |
| Lowercase Letter | 6618 | < 0.1% |
| Uppercase Letter | 7 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6057096 | |
| 0 | 4860793 | |
| 9 | 4024283 | |
| 2 | 2319326 | 9.0% |
| 8 | 1867324 | 7.3% |
| 7 | 1478663 | 5.8% |
| 6 | 1473117 | 5.7% |
| 3 | 1249910 | 4.9% |
| 5 | 1205209 | 4.7% |
| 4 | 1132005 | 4.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 2 | |
| S | 2 | |
| W | 1 | |
| E | 1 | |
| P | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 201012 | |
| , | 3424 | 1.7% |
| : | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3309 | |
| r | 3309 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6096876 |
Space Separator
| Value | Count | Frequency (%) |
| 6620 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 31975661 | |
| Latin | 6625 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 6096876 | |
| 1 | 6057096 | |
| 0 | 4860793 | |
| 9 | 4024283 | |
| 2 | 2319326 | 7.3% |
| 8 | 1867324 | 5.8% |
| 7 | 1478663 | 4.6% |
| 6 | 1473117 | 4.6% |
| 3 | 1249910 | 3.9% |
| 5 | 1205209 | 3.8% |
| Other values (7) | 1343064 | 4.2% |
Latin
| Value | Count | Frequency (%) |
| o | 3309 | |
| r | 3309 | |
| G | 2 | < 0.1% |
| S | 2 | < 0.1% |
| W | 1 | < 0.1% |
| E | 1 | < 0.1% |
| P | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31982286 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 6096876 | |
| 1 | 6057096 | |
| 0 | 4860793 | |
| 9 | 4024283 | |
| 2 | 2319326 | 7.3% |
| 8 | 1867324 | 5.8% |
| 7 | 1478663 | 4.6% |
| 6 | 1473117 | 4.6% |
| 3 | 1249910 | 3.9% |
| 5 | 1205209 | 3.8% |
| Other values (14) | 1349689 | 4.2% |
eventTime
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Nunavut |
|---|
| Value | Count | Frequency (%) |
| nunavut | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 2 | |
| N | 1 | |
| n | 1 | |
| a | 1 | |
| v | 1 | |
| t | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 1 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 2 | |
| n | 1 | |
| a | 1 | |
| v | 1 | |
| t | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 2 | |
| N | 1 | |
| n | 1 | |
| a | 1 | |
| v | 1 | |
| t | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 2 | |
| N | 1 | |
| n | 1 | |
| a | 1 | |
| v | 1 | |
| t | 1 |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 806907 |
| Missing (%) | 21.2% |
| Memory size | 29.1 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.772580866 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 114 |
|---|---|
| 2nd row | 90 |
| 3rd row | 218 |
| 4th row | 334 |
| 5th row | 233 |
| Value | Count | Frequency (%) |
| 212 | 37593 | 1.3% |
| 243 | 33383 | 1.1% |
| 181 | 32170 | 1.1% |
| 151 | 30939 | 1.0% |
| 120 | 24240 | 0.8% |
| 213 | 22295 | 0.7% |
| 273 | 20900 | 0.7% |
| 90 | 20298 | 0.7% |
| 334 | 19034 | 0.6% |
| 304 | 18795 | 0.6% |
| Other values (356) | 2747545 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1695498 | |
| 2 | 1611644 | |
| 3 | 1027787 | |
| 4 | 641030 | 7.7% |
| 5 | 618209 | 7.4% |
| 0 | 582501 | 7.0% |
| 6 | 556510 | 6.7% |
| 9 | 547129 | 6.6% |
| 8 | 530041 | 6.4% |
| 7 | 527334 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8337683 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1695498 | |
| 2 | 1611644 | |
| 3 | 1027787 | |
| 4 | 641030 | 7.7% |
| 5 | 618209 | 7.4% |
| 0 | 582501 | 7.0% |
| 6 | 556510 | 6.7% |
| 9 | 547129 | 6.6% |
| 8 | 530041 | 6.4% |
| 7 | 527334 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8337683 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1695498 | |
| 2 | 1611644 | |
| 3 | 1027787 | |
| 4 | 641030 | 7.7% |
| 5 | 618209 | 7.4% |
| 0 | 582501 | 7.0% |
| 6 | 556510 | 6.7% |
| 9 | 547129 | 6.6% |
| 8 | 530041 | 6.4% |
| 7 | 527334 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8337683 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1695498 | |
| 2 | 1611644 | |
| 3 | 1027787 | |
| 4 | 641030 | 7.7% |
| 5 | 618209 | 7.4% |
| 0 | 582501 | 7.0% |
| 6 | 556510 | 6.7% |
| 9 | 547129 | 6.6% |
| 8 | 530041 | 6.4% |
| 7 | 527334 | 6.3% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 805827 |
| Missing (%) | 21.1% |
| Memory size | 29.1 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.773799377 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 114 |
|---|---|
| 2nd row | 90 |
| 3rd row | 218 |
| 4th row | 334 |
| 5th row | 233 |
| Value | Count | Frequency (%) |
| 212 | 38231 | 1.3% |
| 243 | 35053 | 1.2% |
| 181 | 32432 | 1.1% |
| 151 | 29225 | 1.0% |
| 120 | 24168 | 0.8% |
| 273 | 21866 | 0.7% |
| 90 | 21275 | 0.7% |
| 213 | 21123 | 0.7% |
| 334 | 19989 | 0.7% |
| 304 | 19955 | 0.7% |
| Other values (356) | 2744955 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1686177 | |
| 2 | 1613053 | |
| 3 | 1036459 | |
| 4 | 647209 | 7.8% |
| 5 | 617934 | 7.4% |
| 0 | 584245 | 7.0% |
| 6 | 553055 | 6.6% |
| 9 | 545035 | 6.5% |
| 8 | 531209 | 6.4% |
| 7 | 529967 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8344343 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1686177 | |
| 2 | 1613053 | |
| 3 | 1036459 | |
| 4 | 647209 | 7.8% |
| 5 | 617934 | 7.4% |
| 0 | 584245 | 7.0% |
| 6 | 553055 | 6.6% |
| 9 | 545035 | 6.5% |
| 8 | 531209 | 6.4% |
| 7 | 529967 | 6.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8344343 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1686177 | |
| 2 | 1613053 | |
| 3 | 1036459 | |
| 4 | 647209 | 7.8% |
| 5 | 617934 | 7.4% |
| 0 | 584245 | 7.0% |
| 6 | 553055 | 6.6% |
| 9 | 545035 | 6.5% |
| 8 | 531209 | 6.4% |
| 7 | 529967 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8344343 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1686177 | |
| 2 | 1613053 | |
| 3 | 1036459 | |
| 4 | 647209 | 7.8% |
| 5 | 617934 | 7.4% |
| 0 | 584245 | 7.0% |
| 6 | 553055 | 6.6% |
| 9 | 545035 | 6.5% |
| 8 | 531209 | 6.4% |
| 7 | 529967 | 6.4% |
year
Text
Missing 
| Distinct | 322 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 653351 |
| Missing (%) | 17.1% |
| Memory size | 29.1 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 4 |
| Mean length | 4.000020565 |
| Min length | 4 |
Unique
| Unique | 48 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1981 |
|---|---|
| 2nd row | 1952 |
| 3rd row | 1958 |
| 4th row | 1900 |
| 5th row | 1988 |
| Value | Count | Frequency (%) |
| 1966 | 58752 | 1.9% |
| 1967 | 54040 | 1.7% |
| 1964 | 53626 | 1.7% |
| 1977 | 51065 | 1.6% |
| 1968 | 50503 | 1.6% |
| 1965 | 47521 | 1.5% |
| 1969 | 45348 | 1.4% |
| 1963 | 41039 | 1.3% |
| 1970 | 40287 | 1.3% |
| 1971 | 40050 | 1.3% |
| Other values (319) | 2678524 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3560563 | |
| 9 | 3272320 | |
| 8 | 1124797 | 8.9% |
| 0 | 820664 | 6.5% |
| 6 | 800837 | 6.3% |
| 7 | 730947 | 5.8% |
| 2 | 679649 | 5.4% |
| 5 | 556593 | 4.4% |
| 4 | 552089 | 4.4% |
| 3 | 544529 | 4.3% |
| Other values (26) | 69 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12642988 | |
| Lowercase Letter | 52 | < 0.1% |
| Space Separator | 7 | < 0.1% |
| Uppercase Letter | 7 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 11 | |
| i | 6 | |
| t | 5 | |
| e | 4 | 7.7% |
| o | 4 | 7.7% |
| n | 4 | 7.7% |
| a | 3 | 5.8% |
| s | 3 | 5.8% |
| b | 2 | 3.8% |
| l | 2 | 3.8% |
| Other values (7) | 8 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3560563 | |
| 9 | 3272320 | |
| 8 | 1124797 | 8.9% |
| 0 | 820664 | 6.5% |
| 6 | 800837 | 6.3% |
| 7 | 730947 | 5.8% |
| 2 | 679649 | 5.4% |
| 5 | 556593 | 4.4% |
| 4 | 552089 | 4.4% |
| 3 | 544529 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 2 | |
| D | 2 | |
| T | 1 | |
| N | 1 | |
| H | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12642998 | |
| Latin | 59 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 11 | |
| i | 6 | 10.2% |
| t | 5 | 8.5% |
| e | 4 | 6.8% |
| o | 4 | 6.8% |
| n | 4 | 6.8% |
| a | 3 | 5.1% |
| s | 3 | 5.1% |
| F | 2 | 3.4% |
| b | 2 | 3.4% |
| Other values (12) | 15 |
Common
| Value | Count | Frequency (%) |
| 1 | 3560563 | |
| 9 | 3272320 | |
| 8 | 1124797 | 8.9% |
| 0 | 820664 | 6.5% |
| 6 | 800837 | 6.3% |
| 7 | 730947 | 5.8% |
| 2 | 679649 | 5.4% |
| 5 | 556593 | 4.4% |
| 4 | 552089 | 4.4% |
| 3 | 544529 | 4.3% |
| Other values (4) | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12643057 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3560563 | |
| 9 | 3272320 | |
| 8 | 1124797 | 8.9% |
| 0 | 820664 | 6.5% |
| 6 | 800837 | 6.3% |
| 7 | 730947 | 5.8% |
| 2 | 679649 | 5.4% |
| 5 | 556593 | 4.4% |
| 4 | 552089 | 4.4% |
| 3 | 544529 | 4.3% |
| Other values (26) | 69 | < 0.1% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 799915 |
| Missing (%) | 21.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.174196068 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 3 |
| 3rd row | 8 |
| 4th row | 11 |
| 5th row | 8 |
| Value | Count | Frequency (%) |
| 7 | 391749 | |
| 8 | 361170 | |
| 6 | 326519 | |
| 5 | 309261 | |
| 4 | 251357 | |
| 9 | 250756 | |
| 3 | 228476 | |
| 10 | 201213 | |
| 2 | 197532 | |
| 11 | 182252 | |
| Other values (2) | 313899 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 879616 | |
| 7 | 391749 | |
| 8 | 361170 | |
| 2 | 339126 | 9.6% |
| 6 | 326519 | 9.2% |
| 5 | 309261 | 8.7% |
| 4 | 251357 | 7.1% |
| 9 | 250756 | 7.1% |
| 3 | 228476 | 6.5% |
| 0 | 201213 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3539243 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 879616 | |
| 7 | 391749 | |
| 8 | 361170 | |
| 2 | 339126 | 9.6% |
| 6 | 326519 | 9.2% |
| 5 | 309261 | 8.7% |
| 4 | 251357 | 7.1% |
| 9 | 250756 | 7.1% |
| 3 | 228476 | 6.5% |
| 0 | 201213 | 5.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3539243 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 879616 | |
| 7 | 391749 | |
| 8 | 361170 | |
| 2 | 339126 | 9.6% |
| 6 | 326519 | 9.2% |
| 5 | 309261 | 8.7% |
| 4 | 251357 | 7.1% |
| 9 | 250756 | 7.1% |
| 3 | 228476 | 6.5% |
| 0 | 201213 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3539243 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 879616 | |
| 7 | 391749 | |
| 8 | 361170 | |
| 2 | 339126 | 9.6% |
| 6 | 326519 | 9.2% |
| 5 | 309261 | 8.7% |
| 4 | 251357 | 7.1% |
| 9 | 250756 | 7.1% |
| 3 | 228476 | 6.5% |
| 0 | 201213 | 5.7% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1074234 |
| Missing (%) | 28.2% |
| Memory size | 29.1 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.706628246 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 24 |
|---|---|
| 2nd row | 30 |
| 3rd row | 6 |
| 4th row | 20 |
| 5th row | 8 |
| Value | Count | Frequency (%) |
| 15 | 98711 | 3.6% |
| 10 | 98499 | 3.6% |
| 20 | 97124 | 3.5% |
| 1 | 94601 | 3.5% |
| 19 | 94164 | 3.4% |
| 8 | 93400 | 3.4% |
| 13 | 92580 | 3.4% |
| 18 | 92566 | 3.4% |
| 21 | 92198 | 3.4% |
| 25 | 91203 | 3.3% |
| Other values (21) | 1794819 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1245266 | |
| 2 | 1151590 | |
| 3 | 393455 | 8.4% |
| 5 | 279261 | 6.0% |
| 0 | 274680 | 5.9% |
| 8 | 274022 | 5.9% |
| 6 | 267711 | 5.7% |
| 7 | 265434 | 5.7% |
| 4 | 264266 | 5.7% |
| 9 | 260246 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4675931 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1245266 | |
| 2 | 1151590 | |
| 3 | 393455 | 8.4% |
| 5 | 279261 | 6.0% |
| 0 | 274680 | 5.9% |
| 8 | 274022 | 5.9% |
| 6 | 267711 | 5.7% |
| 7 | 265434 | 5.7% |
| 4 | 264266 | 5.7% |
| 9 | 260246 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4675931 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1245266 | |
| 2 | 1151590 | |
| 3 | 393455 | 8.4% |
| 5 | 279261 | 6.0% |
| 0 | 274680 | 5.9% |
| 8 | 274022 | 5.9% |
| 6 | 267711 | 5.7% |
| 7 | 265434 | 5.7% |
| 4 | 264266 | 5.7% |
| 9 | 260246 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4675931 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1245266 | |
| 2 | 1151590 | |
| 3 | 393455 | 8.4% |
| 5 | 279261 | 6.0% |
| 0 | 274680 | 5.9% |
| 8 | 274022 | 5.9% |
| 6 | 267711 | 5.7% |
| 7 | 265434 | 5.7% |
| 4 | 264266 | 5.7% |
| 9 | 260246 | 5.6% |
Missing 
| Distinct | 221213 |
|---|---|
| Distinct (%) | 12.4% |
| Missing | 2027788 |
| Missing (%) | 53.2% |
| Memory size | 29.1 MiB |
Length
| Max length | 194 |
|---|---|
| Median length | 11 |
| Mean length | 13.22104046 |
| Min length | 1 |
Unique
| Unique | 88101 ? |
|---|---|
| Unique (%) | 4.9% |
Sample
| 1st row | 24 APR 1981 |
|---|---|
| 2nd row | 6 Aug 1958 |
| 3rd row | 24 Jun 1934 |
| 4th row | 24 Mar 1974 |
| 5th row | 23-29 January 1885 |
| Value | Count | Frequency (%) |
| 704830 | 11.7% | |
| 00 | 328903 | 5.5% |
| 0000 | 154437 | 2.6% |
| aug | 152852 | 2.5% |
| may | 151178 | 2.5% |
| jul | 150920 | 2.5% |
| jun | 135719 | 2.3% |
| apr | 125753 | 2.1% |
| mar | 116184 | 1.9% |
| sep | 109403 | 1.8% |
| Other values (61813) | 3870262 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4214130 | ||
| 1 | 2567975 | 10.9% |
| 0 | 2139022 | 9.1% |
| 9 | 1866296 | 7.9% |
| - | 1720550 | 7.3% |
| 2 | 976132 | 4.1% |
| 8 | 743384 | 3.1% |
| 6 | 653108 | 2.8% |
| 7 | 593250 | 2.5% |
| 3 | 531644 | 2.3% |
| Other values (96) | 7611399 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11001535 | |
| Space Separator | 4214130 | 17.8% |
| Lowercase Letter | 3904026 | 16.5% |
| Uppercase Letter | 2168227 | 9.2% |
| Dash Punctuation | 1720560 | 7.3% |
| Other Punctuation | 577783 | 2.4% |
| Open Punctuation | 15135 | 0.1% |
| Close Punctuation | 15132 | 0.1% |
| Connector Punctuation | 190 | < 0.1% |
| Math Symbol | 165 | < 0.1% |
| Other values (4) | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 423036 | |
| a | 412743 | |
| r | 411061 | |
| e | 384156 | 9.8% |
| n | 288728 | 7.4% |
| c | 225164 | 5.8% |
| p | 222891 | 5.7% |
| y | 215954 | 5.5% |
| t | 190592 | 4.9% |
| b | 168645 | 4.3% |
| Other values (23) | 961056 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 406715 | |
| A | 362558 | |
| M | 284047 | |
| N | 147056 | 6.8% |
| S | 138874 | 6.4% |
| O | 125338 | 5.8% |
| F | 111776 | 5.2% |
| T | 86104 | 4.0% |
| U | 75652 | 3.5% |
| D | 75433 | 3.5% |
| Other values (18) | 354674 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 290713 | |
| : | 161224 | |
| ; | 52096 | 9.0% |
| . | 44672 | 7.7% |
| , | 24797 | 4.3% |
| ' | 2444 | 0.4% |
| * | 966 | 0.2% |
| ? | 463 | 0.1% |
| ! | 208 | < 0.1% |
| & | 155 | < 0.1% |
| Other values (4) | 45 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2567975 | |
| 0 | 2139022 | |
| 9 | 1866296 | |
| 2 | 976132 | 8.9% |
| 8 | 743384 | 6.8% |
| 6 | 653108 | 5.9% |
| 7 | 593250 | 5.4% |
| 3 | 531644 | 4.8% |
| 5 | 479032 | 4.4% |
| 4 | 451692 | 4.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 74 | |
| | | 72 | |
| = | 12 | 7.3% |
| ~ | 3 | 1.8% |
| < | 2 | 1.2% |
| ± | 2 | 1.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 13901 | |
| ( | 1231 | 8.1% |
| { | 3 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 13898 | |
| ) | 1231 | 8.1% |
| } | 3 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1720550 | |
| – | 10 | < 0.1% |
Format
| Value | Count | Frequency (%) |
| | 2 | |
| | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4214130 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 190 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 1 |
Other Letter
| Value | Count | Frequency (%) |
| º | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17544636 | |
| Latin | 6072254 | 25.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 423036 | 7.0% |
| a | 412743 | 6.8% |
| r | 411061 | 6.8% |
| J | 406715 | 6.7% |
| e | 384156 | 6.3% |
| A | 362558 | 6.0% |
| n | 288728 | 4.8% |
| M | 284047 | 4.7% |
| c | 225164 | 3.7% |
| p | 222891 | 3.7% |
| Other values (52) | 2651155 |
Common
| Value | Count | Frequency (%) |
| 4214130 | ||
| 1 | 2567975 | |
| 0 | 2139022 | |
| 9 | 1866296 | |
| - | 1720550 | |
| 2 | 976132 | 5.6% |
| 8 | 743384 | 4.2% |
| 6 | 653108 | 3.7% |
| 7 | 593250 | 3.4% |
| 3 | 531644 | 3.0% |
| Other values (34) | 1539145 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23616832 | |
| None | 45 | < 0.1% |
| Punctuation | 13 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4214130 | ||
| 1 | 2567975 | 10.9% |
| 0 | 2139022 | 9.1% |
| 9 | 1866296 | 7.9% |
| - | 1720550 | 7.3% |
| 2 | 976132 | 4.1% |
| 8 | 743384 | 3.1% |
| 6 | 653108 | 2.8% |
| 7 | 593250 | 2.5% |
| 3 | 531644 | 2.3% |
| Other values (79) | 7611341 |
None
| Value | Count | Frequency (%) |
| é | 16 | |
| û | 8 | |
| ü | 4 | 8.9% |
| ô | 3 | 6.7% |
| ± | 2 | 4.4% |
| ä | 2 | 4.4% |
| Æ | 2 | 4.4% |
| ½ | 2 | 4.4% |
| ° | 1 | 2.2% |
| º | 1 | 2.2% |
| Other values (4) | 4 | 8.9% |
Punctuation
| Value | Count | Frequency (%) |
| – | 10 | |
| | 2 | 15.4% |
| … | 1 | 7.7% |
habitat
Text
Missing 
| Distinct | 106103 |
|---|---|
| Distinct (%) | 35.6% |
| Missing | 3516278 |
| Missing (%) | 92.2% |
| Memory size | 29.1 MiB |
Length
| Max length | 37931 |
|---|---|
| Median length | 533 |
| Mean length | 30.99341215 |
| Min length | 1 |
Unique
| Unique | 82257 ? |
|---|---|
| Unique (%) | 27.6% |
Sample
| 1st row | abandoned field |
|---|---|
| 2nd row | In wet mixed hardwood-pine-podocarpus forest. |
| 3rd row | Ecological remarks by collector(s): yes |
| 4th row | Rainforest |
| 5th row | Tropical dry forest |
| Value | Count | Frequency (%) |
| forest | 70097 | 5.0% |
| on | 40365 | 2.9% |
| and | 34529 | 2.5% |
| in | 33893 | 2.4% |
| with | 24655 | 1.8% |
| of | 24444 | 1.7% |
| by | 24094 | 1.7% |
| remarks | 20080 | 1.4% |
| ecological | 20077 | 1.4% |
| collector(s | 20073 | 1.4% |
| Other values (31427) | 1094865 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1107679 | 12.0% | |
| e | 823152 | 8.9% |
| a | 709570 | 7.7% |
| o | 679364 | 7.4% |
| r | 608992 | 6.6% |
| s | 585109 | 6.3% |
| n | 520012 | 5.6% |
| i | 459827 | 5.0% |
| t | 445145 | 4.8% |
| l | 408741 | 4.4% |
| Other values (132) | 2882898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7389357 | |
| Space Separator | 1107679 | 12.0% |
| Uppercase Letter | 394546 | 4.3% |
| Other Punctuation | 229067 | 2.5% |
| Decimal Number | 29101 | 0.3% |
| Close Punctuation | 25311 | 0.3% |
| Open Punctuation | 25287 | 0.3% |
| Dash Punctuation | 18502 | 0.2% |
| Control | 9139 | 0.1% |
| Math Symbol | 2404 | < 0.1% |
| Other values (8) | 96 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 823152 | |
| a | 709570 | 9.6% |
| o | 679364 | 9.2% |
| r | 608992 | 8.2% |
| s | 585109 | 7.9% |
| n | 520012 | 7.0% |
| i | 459827 | 6.2% |
| t | 445145 | 6.0% |
| l | 408741 | 5.5% |
| d | 328756 | 4.4% |
| Other values (45) | 1820689 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 44502 | 11.3% |
| E | 32517 | 8.2% |
| M | 31497 | 8.0% |
| C | 24824 | 6.3% |
| R | 24221 | 6.1% |
| P | 23660 | 6.0% |
| O | 22747 | 5.8% |
| F | 21984 | 5.6% |
| A | 21935 | 5.6% |
| T | 21053 | 5.3% |
| Other values (21) | 125606 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 92681 | |
| . | 88295 | |
| : | 23067 | 10.1% |
| ; | 12709 | 5.5% |
| & | 4733 | 2.1% |
| / | 3557 | 1.6% |
| " | 1718 | 0.7% |
| ' | 1153 | 0.5% |
| ? | 433 | 0.2% |
| % | 339 | 0.1% |
| Other values (6) | 382 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7724 | |
| 1 | 4005 | |
| 2 | 3469 | |
| 3 | 3407 | |
| 5 | 3160 | |
| 4 | 2310 | 7.9% |
| 6 | 1580 | 5.4% |
| 8 | 1334 | 4.6% |
| 9 | 1101 | 3.8% |
| 7 | 1011 | 3.5% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1170 | |
| ~ | 833 | |
| | | 204 | 8.5% |
| = | 97 | 4.0% |
| ± | 71 | 3.0% |
| < | 16 | 0.7% |
| > | 12 | 0.5% |
| × | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 24930 | |
| ] | 297 | 1.2% |
| } | 84 | 0.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 24915 | |
| [ | 288 | 1.1% |
| { | 84 | 0.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18482 | |
| – | 12 | 0.1% |
| — | 8 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 9091 | ||
| 48 | 0.5% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 51 | |
| ¦ | 1 | 1.9% |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 11 | |
| › | 2 | 15.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1107679 |
Other Letter
| Value | Count | Frequency (%) |
| º | 11 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 8 |
Currency Symbol
| Value | Count | Frequency (%) |
| £ | 2 |
Other Number
| Value | Count | Frequency (%) |
| ² | 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7783914 | |
| Common | 1446575 | 15.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 823152 | 10.6% |
| a | 709570 | 9.1% |
| o | 679364 | 8.7% |
| r | 608992 | 7.8% |
| s | 585109 | 7.5% |
| n | 520012 | 6.7% |
| i | 459827 | 5.9% |
| t | 445145 | 5.7% |
| l | 408741 | 5.3% |
| d | 328756 | 4.2% |
| Other values (77) | 2215246 |
Common
| Value | Count | Frequency (%) |
| 1107679 | ||
| , | 92681 | 6.4% |
| . | 88295 | 6.1% |
| ) | 24930 | 1.7% |
| ( | 24915 | 1.7% |
| : | 23067 | 1.6% |
| - | 18482 | 1.3% |
| ; | 12709 | 0.9% |
| 9091 | 0.6% | |
| 0 | 7724 | 0.5% |
| Other values (45) | 37002 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9217756 | |
| None | 12632 | 0.1% |
| Punctuation | 101 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1107679 | 12.0% | |
| e | 823152 | 8.9% |
| a | 709570 | 7.7% |
| o | 679364 | 7.4% |
| r | 608992 | 6.6% |
| s | 585109 | 6.3% |
| n | 520012 | 5.6% |
| i | 459827 | 5.0% |
| t | 445145 | 4.8% |
| l | 408741 | 4.4% |
| Other values (84) | 2870165 |
None
| Value | Count | Frequency (%) |
| ú | 1917 | |
| ê | 1816 | |
| é | 1812 | |
| ó | 1726 | |
| í | 1471 | |
| á | 1331 | |
| ñ | 1008 | |
| è | 660 | 5.2% |
| à | 228 | 1.8% |
| ç | 92 | 0.7% |
| Other values (32) | 571 | 4.5% |
Punctuation
| Value | Count | Frequency (%) |
| … | 60 | |
| – | 12 | 11.9% |
| ” | 11 | 10.9% |
| — | 8 | 7.9% |
| “ | 8 | 7.9% |
| › | 2 | 2.0% |
sampleSizeValue
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1000.0 |
|---|
| Value | Count | Frequency (%) |
| 1000.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 1 | 16.7% |
| . | 1 | 16.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5 | |
| Other Punctuation | 1 | 16.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 1 | 20.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 1 | 16.7% |
| . | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 1 | 16.7% |
| . | 1 | 16.7% |
eventRemarks
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | GPS |
|---|
| Value | Count | Frequency (%) |
| gps | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 1 | |
| P | 1 | |
| S | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 | |
| P | 1 | |
| S | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 1 | |
| P | 1 | |
| S | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 1 | |
| P | 1 | |
| S | 1 |
locationID
Text
Missing 
| Distinct | 65557 |
|---|---|
| Distinct (%) | 14.7% |
| Missing | 3366761 |
| Missing (%) | 88.3% |
| Memory size | 29.1 MiB |
Length
| Max length | 49374 |
|---|---|
| Median length | 131 |
| Mean length | 4.671313414 |
| Min length | 1 |
Unique
| Unique | 36003 ? |
|---|---|
| Unique (%) | 8.0% |
Sample
| 1st row | 31 |
|---|---|
| 2nd row | GS 03383 |
| 3rd row | M4 |
| 4th row | 9 |
| 5th row | 68-36 |
| Value | Count | Frequency (%) |
| d | 5711 | 1.1% |
| not | 5048 | 1.0% |
| rec | 4891 | 1.0% |
| 4 | 3834 | 0.8% |
| 1 | 3635 | 0.7% |
| rhb | 3185 | 0.6% |
| rfb | 3103 | 0.6% |
| 2 | 2955 | 0.6% |
| 3 | 2478 | 0.5% |
| 6 | 2386 | 0.5% |
| Other values (55856) | 469594 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 229414 | 11.0% |
| 2 | 190821 | 9.1% |
| 0 | 156212 | 7.5% |
| - | 139092 | 6.7% |
| 5 | 138185 | 6.6% |
| 3 | 138042 | 6.6% |
| 4 | 132581 | 6.3% |
| 6 | 118976 | 5.7% |
| 7 | 95419 | 4.6% |
| 8 | 87223 | 4.2% |
| Other values (91) | 663691 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1366265 | |
| Uppercase Letter | 410919 | 19.7% |
| Dash Punctuation | 139094 | 6.7% |
| Lowercase Letter | 59759 | 2.9% |
| Space Separator | 57287 | 2.7% |
| Other Punctuation | 37590 | 1.8% |
| Control | 11792 | 0.6% |
| Connector Punctuation | 3314 | 0.2% |
| Open Punctuation | 1728 | 0.1% |
| Close Punctuation | 1547 | 0.1% |
| Other values (2) | 361 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7316 | |
| t | 7236 | |
| o | 6746 | |
| e | 6311 | |
| i | 5591 | |
| n | 4601 | |
| r | 4425 | |
| l | 2509 | 4.2% |
| c | 2145 | 3.6% |
| u | 2062 | 3.5% |
| Other values (26) | 10817 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 43233 | 10.5% |
| S | 38365 | 9.3% |
| C | 32592 | 7.9% |
| B | 29728 | 7.2% |
| M | 26503 | 6.4% |
| R | 26076 | 6.3% |
| N | 24656 | 6.0% |
| E | 22030 | 5.4% |
| I | 20351 | 5.0% |
| T | 19501 | 4.7% |
| Other values (17) | 127884 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 16051 | |
| . | 13690 | |
| , | 3696 | 9.8% |
| / | 2453 | 6.5% |
| # | 688 | 1.8% |
| ; | 498 | 1.3% |
| & | 252 | 0.7% |
| ? | 153 | 0.4% |
| ' | 51 | 0.1% |
| * | 50 | 0.1% |
| Other values (4) | 8 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 229414 | |
| 2 | 190821 | |
| 0 | 156212 | |
| 5 | 138185 | |
| 3 | 138042 | |
| 4 | 132581 | |
| 6 | 118976 | |
| 7 | 95419 | |
| 8 | 87223 | 6.4% |
| 9 | 79392 | 5.8% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 315 | |
| = | 41 | 11.4% |
| | | 3 | 0.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 139092 | |
| – | 2 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 11730 | ||
| 62 | 0.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1599 | |
| [ | 129 | 7.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1418 | |
| ] | 129 | 8.3% |
Space Separator
| Value | Count | Frequency (%) |
| 57287 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3314 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1618978 | |
| Latin | 470678 | 22.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 43233 | 9.2% |
| S | 38365 | 8.2% |
| C | 32592 | 6.9% |
| B | 29728 | 6.3% |
| M | 26503 | 5.6% |
| R | 26076 | 5.5% |
| N | 24656 | 5.2% |
| E | 22030 | 4.7% |
| I | 20351 | 4.3% |
| T | 19501 | 4.1% |
| Other values (53) | 187643 |
Common
| Value | Count | Frequency (%) |
| 1 | 229414 | |
| 2 | 190821 | |
| 0 | 156212 | |
| - | 139092 | |
| 5 | 138185 | |
| 3 | 138042 | |
| 4 | 132581 | |
| 6 | 118976 | |
| 7 | 95419 | |
| 8 | 87223 | 5.4% |
| Other values (28) | 193013 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2089634 | |
| None | 20 | < 0.1% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 229414 | 11.0% |
| 2 | 190821 | 9.1% |
| 0 | 156212 | 7.5% |
| - | 139092 | 6.7% |
| 5 | 138185 | 6.6% |
| 3 | 138042 | 6.6% |
| 4 | 132581 | 6.3% |
| 6 | 118976 | 5.7% |
| 7 | 95419 | 4.6% |
| 8 | 87223 | 4.2% |
| Other values (78) | 663669 |
None
| Value | Count | Frequency (%) |
| ä | 3 | |
| é | 3 | |
| á | 2 | |
| í | 2 | |
| ° | 2 | |
| ü | 2 | |
| ã | 1 | 5.0% |
| å | 1 | 5.0% |
| ö | 1 | 5.0% |
| è | 1 | 5.0% |
| Other values (2) | 2 |
Punctuation
| Value | Count | Frequency (%) |
| – | 2 |
higherGeography
Text
Missing 
| Distinct | 56561 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 118692 |
| Missing (%) | 3.1% |
| Memory size | 29.1 MiB |
Length
| Max length | 177 |
|---|---|
| Median length | 138 |
| Mean length | 40.43501785 |
| Min length | 4 |
Unique
| Unique | 17764 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | North Atlantic Ocean, Caribbean Sea, Belize |
|---|---|
| 2nd row | North America, United States, Tennessee |
| 3rd row | North America, United States, West Virginia, Randolph |
| 4th row | United States, Georgia, Decatur County |
| 5th row | North Atlantic Ocean, Gulf of Mexico, United States |
| Value | Count | Frequency (%) |
| america | 1838540 | 9.2% |
| north | 1785934 | 8.9% |
| united | 1390693 | 6.9% |
| states | 1378096 | 6.9% |
| 712178 | 3.6% | |
| south | 711734 | 3.5% |
| ocean | 694520 | 3.5% |
| neotropics | 659180 | 3.3% |
| atlantic | 362469 | 1.8% |
| pacific | 345353 | 1.7% |
| Other values (18600) | 10176334 |
Most occurring characters
| Value | Count | Frequency (%) |
| 16359624 | 10.9% | |
| a | 14443845 | 9.7% |
| i | 10914554 | 7.3% |
| e | 10709030 | 7.2% |
| t | 10494122 | 7.0% |
| r | 8233580 | 5.5% |
| o | 8056092 | 5.4% |
| , | 7690870 | 5.1% |
| n | 7438148 | 5.0% |
| c | 6015663 | 4.0% |
| Other values (176) | 49068320 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 104713822 | |
| Uppercase Letter | 19392029 | 13.0% |
| Space Separator | 16359624 | 10.9% |
| Other Punctuation | 7803699 | 5.2% |
| Dash Punctuation | 965382 | 0.6% |
| Open Punctuation | 94370 | 0.1% |
| Close Punctuation | 94348 | 0.1% |
| Modifier Letter | 221 | < 0.1% |
| Math Symbol | 170 | < 0.1% |
| Decimal Number | 111 | < 0.1% |
| Other values (3) | 72 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 14443845 | |
| i | 10914554 | |
| e | 10709030 | |
| t | 10494122 | |
| r | 8233580 | |
| o | 8056092 | |
| n | 7438148 | 7.1% |
| c | 6015663 | 5.7% |
| s | 5269580 | 5.0% |
| l | 3634298 | 3.5% |
| Other values (88) | 19504910 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3309730 | |
| N | 2853613 | |
| S | 2781807 | |
| U | 1497885 | |
| C | 1419390 | |
| P | 1041591 | 5.4% |
| M | 903434 | 4.7% |
| O | 861595 | 4.4% |
| I | 651469 | 3.4% |
| B | 542484 | 2.8% |
| Other values (39) | 3529031 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7690870 | |
| . | 70709 | 0.9% |
| ' | 28187 | 0.4% |
| / | 10594 | 0.1% |
| ? | 2720 | < 0.1% |
| ; | 452 | < 0.1% |
| & | 74 | < 0.1% |
| * | 46 | < 0.1% |
| : | 41 | < 0.1% |
| ¡ | 3 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 29 | |
| 2 | 27 | |
| 3 | 22 | |
| 0 | 17 | |
| 4 | 7 | 6.3% |
| 6 | 3 | 2.7% |
| 8 | 2 | 1.8% |
| 9 | 2 | 1.8% |
| 7 | 2 | 1.8% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 161 | |
| + | 7 | 4.1% |
| | | 1 | 0.6% |
| ~ | 1 | 0.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 965160 | |
| – | 221 | < 0.1% |
| — | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 76332 | |
| ( | 18038 | 19.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 76308 | |
| ) | 18040 | 19.1% |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 194 | |
| ʼ | 27 | 12.2% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 52 | |
| ¸ | 1 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 16359624 |
Format
| Value | Count | Frequency (%) |
| | 17 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 124105851 | |
| Common | 25317995 | 16.9% |
| Inherited | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 14443845 | 11.6% |
| i | 10914554 | 8.8% |
| e | 10709030 | 8.6% |
| t | 10494122 | 8.5% |
| r | 8233580 | 6.6% |
| o | 8056092 | 6.5% |
| n | 7438148 | 6.0% |
| c | 6015663 | 4.8% |
| s | 5269580 | 4.2% |
| l | 3634298 | 2.9% |
| Other values (137) | 38896939 |
Common
| Value | Count | Frequency (%) |
| 16359624 | ||
| , | 7690870 | |
| - | 965160 | 3.8% |
| [ | 76332 | 0.3% |
| ] | 76308 | 0.3% |
| . | 70709 | 0.3% |
| ' | 28187 | 0.1% |
| ) | 18040 | 0.1% |
| ( | 18038 | 0.1% |
| / | 10594 | < 0.1% |
| Other values (28) | 4133 | < 0.1% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 149209059 | |
| None | 214262 | 0.1% |
| Punctuation | 239 | < 0.1% |
| Modifier Letters | 221 | < 0.1% |
| Latin Ext Additional | 65 | < 0.1% |
| Diacriticals | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 16359624 | 11.0% | |
| a | 14443845 | 9.7% |
| i | 10914554 | 7.3% |
| e | 10709030 | 7.2% |
| t | 10494122 | 7.0% |
| r | 8233580 | 5.5% |
| o | 8056092 | 5.4% |
| , | 7690870 | 5.2% |
| n | 7438148 | 5.0% |
| c | 6015663 | 4.0% |
| Other values (72) | 48853531 |
None
| Value | Count | Frequency (%) |
| á | 69039 | |
| í | 40375 | |
| é | 36868 | |
| ó | 26693 | 12.5% |
| ã | 13821 | 6.5% |
| ô | 6264 | 2.9% |
| ç | 3624 | 1.7% |
| ñ | 3240 | 1.5% |
| Î | 2734 | 1.3% |
| ü | 2671 | 1.2% |
| Other values (71) | 8933 | 4.2% |
Punctuation
| Value | Count | Frequency (%) |
| – | 221 | |
| | 17 | 7.1% |
| — | 1 | 0.4% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 194 | |
| ʼ | 27 | 12.2% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 18 | |
| ị | 10 | |
| ố | 6 | 9.2% |
| ộ | 5 | 7.7% |
| ḍ | 5 | 7.7% |
| ế | 4 | 6.2% |
| ừ | 3 | 4.6% |
| ậ | 3 | 4.6% |
| ṭ | 3 | 4.6% |
| ẵ | 1 | 1.5% |
| Other values (7) | 7 | 10.8% |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 2 |
continent
Text
Missing 
| Distinct | 195 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 534327 |
| Missing (%) | 14.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 57 |
| Mean length | 16.28692421 |
| Min length | 4 |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | North Atlantic Ocean |
|---|---|
| 2nd row | North America |
| 3rd row | North America |
| 4th row | North Atlantic Ocean |
| 5th row | Asia |
| Value | Count | Frequency (%) |
| america | 1838505 | |
| north | 1713311 | |
| ocean | 692905 | 8.7% |
| 659976 | 8.3% | |
| neotropics | 659180 | 8.3% |
| south | 633934 | 7.9% |
| atlantic | 361973 | 4.5% |
| pacific | 344558 | 4.3% |
| africa | 139956 | 1.8% |
| asia-tropical | 124686 | 1.6% |
| Other values (29) | 806425 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 4847298 | 9.1% |
| 4695637 | 8.8% | |
| c | 4576576 | 8.6% |
| i | 4342913 | 8.1% |
| a | 4262577 | 8.0% |
| t | 4129784 | 7.7% |
| o | 3913099 | 7.3% |
| e | 3909963 | 7.3% |
| A | 2706413 | 5.1% |
| N | 2372489 | 4.4% |
| Other values (33) | 13660649 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40274803 | |
| Uppercase Letter | 7528783 | 14.1% |
| Space Separator | 4695637 | 8.8% |
| Dash Punctuation | 872720 | 1.6% |
| Other Punctuation | 45452 | 0.1% |
| Decimal Number | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 4847298 | |
| c | 4576576 | |
| i | 4342913 | |
| a | 4262577 | |
| t | 4129784 | |
| o | 3913099 | |
| e | 3909963 | |
| h | 2348709 | |
| m | 1927373 | 4.8% |
| n | 1468719 | 3.6% |
| Other values (11) | 4547792 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2706413 | |
| N | 2372489 | |
| O | 706244 | 9.4% |
| S | 635469 | 8.4% |
| P | 344622 | 4.6% |
| T | 213478 | 2.8% |
| I | 207748 | 2.8% |
| C | 112108 | 1.5% |
| W | 109235 | 1.5% |
| E | 107549 | 1.4% |
| Other values (3) | 13428 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 44654 | |
| / | 550 | 1.2% |
| ? | 247 | 0.5% |
| . | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 3 | 1 | |
| 0 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4695637 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 872720 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 47803586 | |
| Common | 5613812 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 4847298 | |
| c | 4576576 | |
| i | 4342913 | |
| a | 4262577 | |
| t | 4129784 | 8.6% |
| o | 3913099 | 8.2% |
| e | 3909963 | 8.2% |
| A | 2706413 | 5.7% |
| N | 2372489 | 5.0% |
| h | 2348709 | 4.9% |
| Other values (24) | 10393765 |
Common
| Value | Count | Frequency (%) |
| 4695637 | ||
| - | 872720 | 15.5% |
| , | 44654 | 0.8% |
| / | 550 | < 0.1% |
| ? | 247 | < 0.1% |
| 6 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| . | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 53417398 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 4847298 | 9.1% |
| 4695637 | 8.8% | |
| c | 4576576 | 8.6% |
| i | 4342913 | 8.1% |
| a | 4262577 | 8.0% |
| t | 4129784 | 7.7% |
| o | 3913099 | 7.3% |
| e | 3909963 | 7.3% |
| A | 2706413 | 5.1% |
| N | 2372489 | 4.4% |
| Other values (33) | 13660649 |
waterBody
Text
Missing 
| Distinct | 2959 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 3107446 |
| Missing (%) | 81.5% |
| Memory size | 29.1 MiB |
Length
| Max length | 75 |
|---|---|
| Median length | 73 |
| Mean length | 24.15769409 |
| Min length | 4 |
Unique
| Unique | 1175 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | North Atlantic Ocean, Caribbean Sea |
|---|---|
| 2nd row | North Atlantic Ocean, Gulf of Mexico |
| 3rd row | North Atlantic Ocean, Gulf of Mexico, Galveston Bay |
| 4th row | North Pacific Ocean, Gulf of California |
| 5th row | North Atlantic Ocean, Gulf of Guinea |
| Value | Count | Frequency (%) |
| ocean | 692937 | |
| north | 526630 | |
| atlantic | 362005 | |
| pacific | 281664 | |
| of | 114420 | 4.3% |
| sea | 113745 | 4.3% |
| gulf | 112638 | 4.2% |
| south | 98866 | 3.7% |
| mexico | 87826 | 3.3% |
| caribbean | 51340 | 1.9% |
| Other values (2054) | 223154 | 8.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1958572 | ||
| a | 1779883 | |
| c | 1769422 | |
| t | 1425954 | 8.4% |
| n | 1282457 | 7.5% |
| i | 1197252 | 7.0% |
| e | 1020751 | 6.0% |
| o | 873786 | 5.1% |
| O | 697406 | 4.1% |
| r | 658055 | 3.9% |
| Other values (64) | 4407569 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12299427 | |
| Uppercase Letter | 2553914 | 15.0% |
| Space Separator | 1958572 | 11.5% |
| Other Punctuation | 258083 | 1.5% |
| Dash Punctuation | 849 | < 0.1% |
| Modifier Letter | 186 | < 0.1% |
| Open Punctuation | 38 | < 0.1% |
| Close Punctuation | 38 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1779883 | |
| c | 1769422 | |
| t | 1425954 | |
| n | 1282457 | |
| i | 1197252 | |
| e | 1020751 | |
| o | 873786 | |
| r | 658055 | 5.4% |
| h | 646399 | 5.3% |
| f | 515487 | 4.2% |
| Other values (23) | 1129981 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 697406 | |
| N | 528030 | |
| A | 393486 | |
| P | 291650 | |
| S | 234718 | 9.2% |
| G | 115417 | 4.5% |
| M | 104639 | 4.1% |
| C | 74920 | 2.9% |
| B | 42905 | 1.7% |
| I | 37263 | 1.5% |
| Other values (16) | 33480 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 256772 | |
| ; | 447 | 0.2% |
| ' | 340 | 0.1% |
| . | 265 | 0.1% |
| / | 195 | 0.1% |
| ? | 36 | < 0.1% |
| : | 27 | < 0.1% |
| * | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 37 | |
| [ | 1 | 2.6% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 37 | |
| ] | 1 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 1958572 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 849 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 186 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14853341 | |
| Common | 2217766 | 13.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1779883 | |
| c | 1769422 | |
| t | 1425954 | |
| n | 1282457 | 8.6% |
| i | 1197252 | 8.1% |
| e | 1020751 | 6.9% |
| o | 873786 | 5.9% |
| O | 697406 | 4.7% |
| r | 658055 | 4.4% |
| h | 646399 | 4.4% |
| Other values (49) | 3501976 |
Common
| Value | Count | Frequency (%) |
| 1958572 | ||
| , | 256772 | 11.6% |
| - | 849 | < 0.1% |
| ; | 447 | < 0.1% |
| ' | 340 | < 0.1% |
| . | 265 | < 0.1% |
| / | 195 | < 0.1% |
| ʻ | 186 | < 0.1% |
| ( | 37 | < 0.1% |
| ) | 37 | < 0.1% |
| Other values (5) | 66 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17070508 | |
| None | 413 | < 0.1% |
| Modifier Letters | 186 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1958572 | ||
| a | 1779883 | |
| c | 1769422 | |
| t | 1425954 | 8.4% |
| n | 1282457 | 7.5% |
| i | 1197252 | 7.0% |
| e | 1020751 | 6.0% |
| o | 873786 | 5.1% |
| O | 697406 | 4.1% |
| r | 658055 | 3.9% |
| Other values (55) | 4406970 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 186 |
None
| Value | Count | Frequency (%) |
| ā | 186 | |
| í | 87 | |
| á | 62 | 15.0% |
| ñ | 34 | 8.2% |
| é | 21 | 5.1% |
| ó | 15 | 3.6% |
| è | 6 | 1.5% |
| É | 2 | 0.5% |
islandGroup
Text
Missing 
| Distinct | 711 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 3729526 |
| Missing (%) | 97.8% |
| Memory size | 29.1 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 41 |
| Mean length | 14.65497263 |
| Min length | 4 |
Unique
| Unique | 142 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Pelican Cays |
|---|---|
| 2nd row | Greater Antilles |
| 3rd row | Stewart Islands |
| 4th row | Ralik Chain |
| 5th row | Virgin Islands |
| Value | Count | Frequency (%) |
| islands | 29499 | 16.0% |
| antilles | 14047 | 7.6% |
| greater | 13778 | 7.5% |
| group | 12534 | 6.8% |
| is | 8170 | 4.4% |
| leeward | 4503 | 2.4% |
| new | 3902 | 2.1% |
| hispaniola | 3745 | 2.0% |
| chain | 3337 | 1.8% |
| virgin | 2783 | 1.5% |
| Other values (590) | 87985 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 144757 | 11.7% |
| s | 111752 | 9.0% |
| 99710 | 8.0% | |
| n | 91421 | 7.4% |
| l | 89010 | 7.2% |
| e | 86875 | 7.0% |
| r | 74416 | 6.0% |
| i | 63025 | 5.1% |
| d | 51906 | 4.2% |
| t | 45327 | 3.7% |
| Other values (59) | 381216 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 944900 | |
| Uppercase Letter | 181881 | 14.7% |
| Space Separator | 99710 | 8.0% |
| Other Punctuation | 8968 | 0.7% |
| Open Punctuation | 1958 | 0.2% |
| Close Punctuation | 1958 | 0.2% |
| Dash Punctuation | 18 | < 0.1% |
| Format | 17 | < 0.1% |
| Math Symbol | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 144757 | |
| s | 111752 | |
| n | 91421 | |
| l | 89010 | |
| e | 86875 | |
| r | 74416 | |
| i | 63025 | |
| d | 51906 | 5.5% |
| t | 45327 | 4.8% |
| o | 41461 | 4.4% |
| Other values (20) | 144950 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 40709 | |
| G | 32103 | |
| A | 17580 | |
| C | 13790 | 7.6% |
| V | 9621 | 5.3% |
| L | 9140 | 5.0% |
| S | 8922 | 4.9% |
| B | 6699 | 3.7% |
| N | 5730 | 3.2% |
| H | 5523 | 3.0% |
| Other values (17) | 32064 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8168 | |
| ' | 784 | 8.7% |
| , | 13 | 0.1% |
| ? | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1133 | |
| [ | 825 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1133 | |
| ] | 825 |
Space Separator
| Value | Count | Frequency (%) |
| 99710 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18 |
Format
| Value | Count | Frequency (%) |
| | 17 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1126781 | |
| Common | 112634 | 9.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 144757 | |
| s | 111752 | 9.9% |
| n | 91421 | 8.1% |
| l | 89010 | 7.9% |
| e | 86875 | 7.7% |
| r | 74416 | 6.6% |
| i | 63025 | 5.6% |
| d | 51906 | 4.6% |
| t | 45327 | 4.0% |
| o | 41461 | 3.7% |
| Other values (47) | 326831 |
Common
| Value | Count | Frequency (%) |
| 99710 | ||
| . | 8168 | 7.3% |
| ( | 1133 | 1.0% |
| ) | 1133 | 1.0% |
| [ | 825 | 0.7% |
| ] | 825 | 0.7% |
| ' | 784 | 0.7% |
| - | 18 | < 0.1% |
| | 17 | < 0.1% |
| , | 13 | < 0.1% |
| Other values (2) | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1235671 | |
| None | 3727 | 0.3% |
| Punctuation | 17 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 144757 | 11.7% |
| s | 111752 | 9.0% |
| 99710 | 8.1% | |
| n | 91421 | 7.4% |
| l | 89010 | 7.2% |
| e | 86875 | 7.0% |
| r | 74416 | 6.0% |
| i | 63025 | 5.1% |
| d | 51906 | 4.2% |
| t | 45327 | 3.7% |
| Other values (52) | 377472 |
None
| Value | Count | Frequency (%) |
| Î | 1933 | |
| á | 1755 | |
| Ō | 30 | 0.8% |
| ñ | 7 | 0.2% |
| ù | 1 | < 0.1% |
| à | 1 | < 0.1% |
Punctuation
| Value | Count | Frequency (%) |
| | 17 |
island
Text
Missing 
| Distinct | 4691 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 3560499 |
| Missing (%) | 93.4% |
| Memory size | 29.1 MiB |
Length
| Max length | 47 |
|---|---|
| Median length | 41 |
| Mean length | 9.538844637 |
| Min length | 2 |
Unique
| Unique | 1368 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Honshu |
|---|---|
| 2nd row | Lana'i |
| 3rd row | Cat Cay |
| 4th row | Hawaii |
| 5th row | Sumatra |
| Value | Count | Frequency (%) |
| island | 42237 | 10.8% |
| hispaniola | 20799 | 5.3% |
| cuba | 10640 | 2.7% |
| oahu | 9896 | 2.5% |
| atoll | 8952 | 2.3% |
| luzon | 8577 | 2.2% |
| new | 7682 | 2.0% |
| bermuda | 6749 | 1.7% |
| guinea | 6114 | 1.6% |
| st | 6066 | 1.6% |
| Other values (3576) | 261632 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 373003 | |
| n | 172836 | 7.1% |
| i | 161013 | 6.7% |
| o | 150399 | 6.2% |
| 135744 | 5.6% | |
| l | 133792 | 5.5% |
| e | 121984 | 5.0% |
| u | 119529 | 4.9% |
| s | 109905 | 4.5% |
| r | 96589 | 4.0% |
| Other values (77) | 844257 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1864763 | |
| Uppercase Letter | 382282 | 15.8% |
| Space Separator | 135744 | 5.6% |
| Other Punctuation | 18988 | 0.8% |
| Close Punctuation | 8062 | 0.3% |
| Open Punctuation | 8058 | 0.3% |
| Dash Punctuation | 1145 | < 0.1% |
| Decimal Number | 8 | < 0.1% |
| Modifier Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 373003 | |
| n | 172836 | |
| i | 161013 | |
| o | 150399 | |
| l | 133792 | 7.2% |
| e | 121984 | 6.5% |
| u | 119529 | 6.4% |
| s | 109905 | 5.9% |
| r | 96589 | 5.2% |
| d | 86087 | 4.6% |
| Other values (33) | 339626 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 54105 | |
| C | 36416 | 9.5% |
| H | 35901 | 9.4% |
| S | 29181 | 7.6% |
| B | 28985 | 7.6% |
| M | 24615 | 6.4% |
| T | 17913 | 4.7% |
| A | 17328 | 4.5% |
| G | 17269 | 4.5% |
| L | 17032 | 4.5% |
| Other values (18) | 103537 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9244 | |
| ' | 9091 | |
| , | 582 | 3.1% |
| ? | 56 | 0.3% |
| / | 15 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 3 | 3 | |
| 2 | 1 | 12.5% |
| 6 | 1 | 12.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 5868 | |
| ( | 2190 | 27.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 5867 | |
| ) | 2195 | 27.2% |
Space Separator
| Value | Count | Frequency (%) |
| 135744 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1145 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2247045 | |
| Common | 172006 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 373003 | |
| n | 172836 | 7.7% |
| i | 161013 | 7.2% |
| o | 150399 | 6.7% |
| l | 133792 | 6.0% |
| e | 121984 | 5.4% |
| u | 119529 | 5.3% |
| s | 109905 | 4.9% |
| r | 96589 | 4.3% |
| d | 86087 | 3.8% |
| Other values (61) | 721908 |
Common
| Value | Count | Frequency (%) |
| 135744 | ||
| . | 9244 | 5.4% |
| ' | 9091 | 5.3% |
| [ | 5868 | 3.4% |
| ] | 5867 | 3.4% |
| ) | 2195 | 1.3% |
| ( | 2190 | 1.3% |
| - | 1145 | 0.7% |
| , | 582 | 0.3% |
| ? | 56 | < 0.1% |
| Other values (6) | 24 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2416110 | |
| None | 2934 | 0.1% |
| Latin Ext Additional | 6 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 373003 | |
| n | 172836 | 7.2% |
| i | 161013 | 6.7% |
| o | 150399 | 6.2% |
| 135744 | 5.6% | |
| l | 133792 | 5.5% |
| e | 121984 | 5.0% |
| u | 119529 | 4.9% |
| s | 109905 | 4.5% |
| r | 96589 | 4.0% |
| Other values (56) | 841316 |
None
| Value | Count | Frequency (%) |
| ç | 739 | |
| Î | 657 | |
| é | 407 | |
| ó | 393 | |
| á | 298 | |
| â | 151 | 5.1% |
| ñ | 126 | 4.3% |
| ã | 67 | 2.3% |
| Ö | 26 | 0.9% |
| í | 20 | 0.7% |
| Other values (9) | 50 | 1.7% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ố | 6 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
country
Text
Missing 
| Distinct | 802 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 160727 |
| Missing (%) | 4.2% |
| Memory size | 29.1 MiB |
Length
| Max length | 57 |
|---|---|
| Median length | 51 |
| Mean length | 10.00230992 |
| Min length | 1 |
Unique
| Unique | 181 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Belize |
|---|---|
| 2nd row | United States |
| 3rd row | United States |
| 4th row | United States |
| 5th row | United States |
| Value | Count | Frequency (%) |
| united | 1389854 | |
| states | 1377250 | |
| mexico | 187807 | 3.4% |
| brazil | 153884 | 2.8% |
| philippines | 110607 | 2.0% |
| colombia | 95091 | 1.7% |
| canada | 81065 | 1.5% |
| panama | 78531 | 1.4% |
| venezuela | 70949 | 1.3% |
| china | 65037 | 1.2% |
| Other values (523) | 1897614 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 4574640 | |
| a | 4243119 | |
| e | 4040895 | |
| i | 3253675 | 8.9% |
| n | 2814016 | 7.7% |
| s | 1918948 | 5.3% |
| d | 1880183 | 5.1% |
| 1854317 | 5.1% | |
| S | 1517111 | 4.2% |
| U | 1433557 | 3.9% |
| Other values (61) | 9011698 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29173572 | |
| Uppercase Letter | 5475863 | 15.0% |
| Space Separator | 1854317 | 5.1% |
| Other Punctuation | 28773 | 0.1% |
| Open Punctuation | 3859 | < 0.1% |
| Close Punctuation | 3859 | < 0.1% |
| Dash Punctuation | 1916 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4574640 | |
| a | 4243119 | |
| e | 4040895 | |
| i | 3253675 | |
| n | 2814016 | |
| s | 1918948 | |
| d | 1880183 | |
| o | 1001931 | 3.4% |
| l | 897555 | 3.1% |
| r | 829433 | 2.8% |
| Other values (22) | 3719177 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1517111 | |
| U | 1433557 | |
| C | 382045 | 7.0% |
| P | 357091 | 6.5% |
| M | 278504 | 5.1% |
| B | 256862 | 4.7% |
| I | 148509 | 2.7% |
| G | 148370 | 2.7% |
| A | 148237 | 2.7% |
| R | 112346 | 2.1% |
| Other values (15) | 693231 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 14847 | |
| , | 10327 | |
| ' | 1418 | 4.9% |
| / | 1333 | 4.6% |
| ? | 835 | 2.9% |
| : | 11 | < 0.1% |
| * | 1 | < 0.1% |
| ; | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 2919 | |
| ( | 940 | 24.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 2919 | |
| ) | 940 | 24.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1854317 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1916 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34649435 | |
| Common | 1892724 | 5.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 4574640 | |
| a | 4243119 | |
| e | 4040895 | |
| i | 3253675 | |
| n | 2814016 | 8.1% |
| s | 1918948 | 5.5% |
| d | 1880183 | 5.4% |
| S | 1517111 | 4.4% |
| U | 1433557 | 4.1% |
| o | 1001931 | 2.9% |
| Other values (47) | 7971360 |
Common
| Value | Count | Frequency (%) |
| 1854317 | ||
| . | 14847 | 0.8% |
| , | 10327 | 0.5% |
| [ | 2919 | 0.2% |
| ] | 2919 | 0.2% |
| - | 1916 | 0.1% |
| ' | 1418 | 0.1% |
| / | 1333 | 0.1% |
| ( | 940 | < 0.1% |
| ) | 940 | < 0.1% |
| Other values (4) | 848 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36538579 | |
| None | 3580 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 4574640 | |
| a | 4243119 | |
| e | 4040895 | |
| i | 3253675 | 8.9% |
| n | 2814016 | 7.7% |
| s | 1918948 | 5.3% |
| d | 1880183 | 5.1% |
| 1854317 | 5.1% | |
| S | 1517111 | 4.2% |
| U | 1433557 | 3.9% |
| Other values (55) | 9008118 |
None
| Value | Count | Frequency (%) |
| é | 1652 | |
| ç | 1023 | |
| ã | 403 | 11.3% |
| í | 374 | 10.4% |
| á | 100 | 2.8% |
| ô | 28 | 0.8% |
stateProvince
Text
Missing 
| Distinct | 7976 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1028496 |
| Missing (%) | 27.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 52 |
| Mean length | 9.274103668 |
| Min length | 1 |
Unique
| Unique | 1947 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Tennessee |
|---|---|
| 2nd row | West Virginia |
| 3rd row | Georgia |
| 4th row | Maine |
| 5th row | Texas |
| Value | Count | Frequency (%) |
| california | 149080 | 4.0% |
| florida | 127806 | 3.5% |
| virginia | 102361 | 2.8% |
| new | 80179 | 2.2% |
| carolina | 80093 | 2.2% |
| north | 67306 | 1.8% |
| texas | 65503 | 1.8% |
| alaska | 63769 | 1.7% |
| massachusetts | 58634 | 1.6% |
| maryland | 49985 | 1.4% |
| Other values (5671) | 2857870 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3896884 | |
| i | 2209345 | 8.6% |
| n | 1894407 | 7.3% |
| o | 1887302 | 7.3% |
| r | 1678217 | 6.5% |
| e | 1361541 | 5.3% |
| s | 1218320 | 4.7% |
| l | 1101834 | 4.3% |
| t | 976954 | 3.8% |
| 916983 | 3.5% | |
| Other values (148) | 8692184 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21070008 | |
| Uppercase Letter | 3685221 | 14.3% |
| Space Separator | 916983 | 3.5% |
| Dash Punctuation | 71055 | 0.3% |
| Other Punctuation | 47247 | 0.2% |
| Open Punctuation | 21634 | 0.1% |
| Close Punctuation | 21629 | 0.1% |
| Math Symbol | 133 | < 0.1% |
| Decimal Number | 32 | < 0.1% |
| Modifier Letter | 27 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3896884 | |
| i | 2209345 | |
| n | 1894407 | |
| o | 1887302 | |
| r | 1678217 | |
| e | 1361541 | 6.5% |
| s | 1218320 | 5.8% |
| l | 1101834 | 5.2% |
| t | 976954 | 4.6% |
| u | 771612 | 3.7% |
| Other values (75) | 4073592 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 530171 | |
| M | 367758 | 10.0% |
| N | 283252 | 7.7% |
| S | 281251 | 7.6% |
| A | 269736 | 7.3% |
| P | 209980 | 5.7% |
| T | 183009 | 5.0% |
| V | 162815 | 4.4% |
| F | 155248 | 4.2% |
| B | 127439 | 3.5% |
| Other values (34) | 1114562 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 31198 | |
| / | 6290 | 13.3% |
| ' | 4974 | 10.5% |
| , | 3610 | 7.6% |
| ? | 1123 | 2.4% |
| & | 47 | 0.1% |
| * | 3 | < 0.1% |
| : | 1 | < 0.1% |
| ¡ | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 19 | |
| 4 | 5 | 15.6% |
| 8 | 2 | 6.2% |
| 2 | 2 | 6.2% |
| 9 | 2 | 6.2% |
| 6 | 1 | 3.1% |
| 7 | 1 | 3.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 126 | |
| + | 6 | 4.5% |
| | | 1 | 0.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 71037 | |
| – | 18 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 12376 | |
| ( | 9258 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 12373 | |
| ) | 9256 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 1 | |
| ¸ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 916983 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʼ | 27 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24755229 | |
| Common | 1078742 | 4.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3896884 | |
| i | 2209345 | 8.9% |
| n | 1894407 | 7.7% |
| o | 1887302 | 7.6% |
| r | 1678217 | 6.8% |
| e | 1361541 | 5.5% |
| s | 1218320 | 4.9% |
| l | 1101834 | 4.5% |
| t | 976954 | 3.9% |
| u | 771612 | 3.1% |
| Other values (119) | 7758813 |
Common
| Value | Count | Frequency (%) |
| 916983 | ||
| - | 71037 | 6.6% |
| . | 31198 | 2.9% |
| [ | 12376 | 1.1% |
| ] | 12373 | 1.1% |
| ( | 9258 | 0.9% |
| ) | 9256 | 0.9% |
| / | 6290 | 0.6% |
| ' | 4974 | 0.5% |
| , | 3610 | 0.3% |
| Other values (19) | 1387 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25662145 | |
| None | 171747 | 0.7% |
| Latin Ext Additional | 34 | < 0.1% |
| Modifier Letters | 27 | < 0.1% |
| Punctuation | 18 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3896884 | |
| i | 2209345 | 8.6% |
| n | 1894407 | 7.4% |
| o | 1887302 | 7.4% |
| r | 1678217 | 6.5% |
| e | 1361541 | 5.3% |
| s | 1218320 | 4.7% |
| l | 1101834 | 4.3% |
| t | 976954 | 3.8% |
| 916983 | 3.6% | |
| Other values (66) | 8520358 |
None
| Value | Count | Frequency (%) |
| á | 60711 | |
| í | 35016 | |
| é | 28069 | |
| ó | 20470 | 11.9% |
| ã | 10442 | 6.1% |
| ô | 5625 | 3.3% |
| ñ | 2577 | 1.5% |
| ü | 2189 | 1.3% |
| ä | 1170 | 0.7% |
| å | 914 | 0.5% |
| Other values (58) | 4564 | 2.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʼ | 27 |
Punctuation
| Value | Count | Frequency (%) |
| – | 18 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ị | 10 | |
| ḍ | 5 | |
| ậ | 3 | 8.8% |
| ừ | 3 | 8.8% |
| ế | 3 | 8.8% |
| ṭ | 3 | 8.8% |
| ộ | 2 | 5.9% |
| ẵ | 1 | 2.9% |
| ḑ | 1 | 2.9% |
| ả | 1 | 2.9% |
| Other values (2) | 2 | 5.9% |
county
Text
Missing 
| Distinct | 15792 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 2948235 |
| Missing (%) | 77.3% |
| Memory size | 29.1 MiB |
Length
| Max length | 56 |
|---|---|
| Median length | 46 |
| Mean length | 10.23553814 |
| Min length | 1 |
Unique
| Unique | 4648 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Randolph |
|---|---|
| 2nd row | Decatur County |
| 3rd row | Penobscot |
| 4th row | Galveston County |
| 5th row | Dona Ana |
| Value | Count | Frequency (%) |
| county | 144252 | 10.8% |
| not | 54273 | 4.1% |
| stated | 54273 | 4.1% |
| san | 21268 | 1.6% |
| prince | 14567 | 1.1% |
| montgomery | 13322 | 1.0% |
| district | 13084 | 1.0% |
| santa | 11912 | 0.9% |
| honolulu | 11830 | 0.9% |
| 11178 | 0.8% | |
| Other values (11130) | 986539 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 840435 | 9.5% |
| o | 723050 | 8.2% |
| n | 688955 | 7.8% |
| e | 672877 | 7.6% |
| t | 608721 | 6.9% |
| r | 479065 | 5.4% |
| 470634 | 5.3% | |
| i | 444139 | 5.0% |
| u | 379380 | 4.3% |
| l | 341812 | 3.9% |
| Other values (124) | 3213516 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6918803 | |
| Uppercase Letter | 1317635 | 14.9% |
| Space Separator | 470634 | 5.3% |
| Open Punctuation | 58067 | 0.7% |
| Close Punctuation | 58046 | 0.7% |
| Other Punctuation | 21597 | 0.2% |
| Dash Punctuation | 17642 | 0.2% |
| Decimal Number | 68 | < 0.1% |
| Modifier Symbol | 51 | < 0.1% |
| Math Symbol | 32 | < 0.1% |
| Other values (2) | 9 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 840435 | |
| o | 723050 | |
| n | 688955 | |
| e | 672877 | |
| t | 608721 | |
| r | 479065 | 6.9% |
| i | 444139 | 6.4% |
| u | 379380 | 5.5% |
| l | 341812 | 4.9% |
| s | 302994 | 4.4% |
| Other values (54) | 1437375 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 256790 | |
| S | 157540 | |
| M | 106824 | 8.1% |
| N | 84019 | 6.4% |
| B | 77068 | 5.8% |
| P | 76996 | 5.8% |
| A | 63198 | 4.8% |
| L | 53016 | 4.0% |
| H | 52717 | 4.0% |
| D | 51718 | 3.9% |
| Other values (32) | 337749 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 11576 | |
| . | 6965 | |
| / | 2212 | 10.2% |
| ? | 444 | 2.1% |
| , | 323 | 1.5% |
| * | 41 | 0.2% |
| & | 27 | 0.1% |
| ; | 4 | < 0.1% |
| : | 2 | < 0.1% |
| ¡ | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 29 | |
| 2 | 24 | |
| 0 | 13 | |
| 4 | 2 | 2.9% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 30 | |
| + | 1 | 3.1% |
| ~ | 1 | 3.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 54306 | |
| ( | 3761 | 6.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 54286 | |
| ) | 3760 | 6.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17439 | |
| – | 203 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 470634 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 51 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 7 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8236438 | |
| Common | 626144 | 7.1% |
| Inherited | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 840435 | 10.2% |
| o | 723050 | 8.8% |
| n | 688955 | 8.4% |
| e | 672877 | 8.2% |
| t | 608721 | 7.4% |
| r | 479065 | 5.8% |
| i | 444139 | 5.4% |
| u | 379380 | 4.6% |
| l | 341812 | 4.1% |
| s | 302994 | 3.7% |
| Other values (96) | 2755010 |
Common
| Value | Count | Frequency (%) |
| 470634 | ||
| [ | 54306 | 8.7% |
| ] | 54286 | 8.7% |
| - | 17439 | 2.8% |
| ' | 11576 | 1.8% |
| . | 6965 | 1.1% |
| ( | 3761 | 0.6% |
| ) | 3760 | 0.6% |
| / | 2212 | 0.4% |
| ? | 444 | 0.1% |
| Other values (17) | 761 | 0.1% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8833904 | |
| None | 28461 | 0.3% |
| Punctuation | 203 | < 0.1% |
| Modifier Letters | 7 | < 0.1% |
| Latin Ext Additional | 7 | < 0.1% |
| Diacriticals | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 840435 | 9.5% |
| o | 723050 | 8.2% |
| n | 688955 | 7.8% |
| e | 672877 | 7.6% |
| t | 608721 | 6.9% |
| r | 479065 | 5.4% |
| 470634 | 5.3% | |
| i | 444139 | 5.0% |
| u | 379380 | 4.3% |
| l | 341812 | 3.9% |
| Other values (65) | 3184836 |
None
| Value | Count | Frequency (%) |
| á | 6113 | |
| é | 5084 | |
| í | 4771 | |
| ó | 4193 | |
| ã | 2909 | |
| ç | 1848 | 6.5% |
| è | 601 | 2.1% |
| ô | 591 | 2.1% |
| ñ | 496 | 1.7% |
| ü | 481 | 1.7% |
| Other values (41) | 1374 | 4.8% |
Punctuation
| Value | Count | Frequency (%) |
| – | 203 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 7 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ộ | 3 | |
| ắ | 1 | 14.3% |
| ầ | 1 | 14.3% |
| ế | 1 | 14.3% |
| ợ | 1 | 14.3% |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 2 |
municipality
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|
| Value | Count | Frequency (%) |
| degrees | 1 | |
| minutes | 1 | |
| seconds | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5 | |
| s | 3 | |
| 2 | 8.7% | |
| n | 2 | 8.7% |
| D | 1 | 4.3% |
| g | 1 | 4.3% |
| r | 1 | 4.3% |
| M | 1 | 4.3% |
| i | 1 | 4.3% |
| u | 1 | 4.3% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18 | |
| Uppercase Letter | 3 | 13.0% |
| Space Separator | 2 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5 | |
| s | 3 | |
| n | 2 | 11.1% |
| g | 1 | 5.6% |
| r | 1 | 5.6% |
| i | 1 | 5.6% |
| u | 1 | 5.6% |
| t | 1 | 5.6% |
| c | 1 | 5.6% |
| o | 1 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1 | |
| M | 1 | |
| S | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21 | |
| Common | 2 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5 | |
| s | 3 | |
| n | 2 | 9.5% |
| D | 1 | 4.8% |
| g | 1 | 4.8% |
| r | 1 | 4.8% |
| M | 1 | 4.8% |
| i | 1 | 4.8% |
| u | 1 | 4.8% |
| t | 1 | 4.8% |
| Other values (4) | 4 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 5 | |
| s | 3 | |
| 2 | 8.7% | |
| n | 2 | 8.7% |
| D | 1 | 4.3% |
| g | 1 | 4.3% |
| r | 1 | 4.3% |
| M | 1 | 4.3% |
| i | 1 | 4.3% |
| u | 1 | 4.3% |
| Other values (5) | 5 |
locality
Text
Missing 
| Distinct | 1351410 |
|---|---|
| Distinct (%) | 41.3% |
| Missing | 544962 |
| Missing (%) | 14.3% |
| Memory size | 29.1 MiB |
Length
| Max length | 140152 |
|---|---|
| Median length | 426 |
| Mean length | 40.30902559 |
| Min length | 1 |
Unique
| Unique | 1062019 ? |
|---|---|
| Unique (%) | 32.5% |
Sample
| 1st row | Carrie Bow Cay, Spur And Groove Zone |
|---|---|
| 2nd row | Eastern edge of Nashville, Davidson County. |
| 3rd row | Monongahela National Forest, 1.2-1.4 mi (by road) W of Bear Heaven Campground, on road to Bickle Knob |
| 4th row | Hales Landing, Flint River about 7 miles below Bainbridge, basal Chattahoochee Formation, Oligocene, Vicksburgian |
| 5th row | Orono |
| Value | Count | Frequency (%) |
| of | 1091981 | 5.1% |
| de | 280973 | 1.3% |
| island | 276714 | 1.3% |
| km | 234570 | 1.1% |
| on | 205256 | 1.0% |
| near | 197075 | 0.9% |
| the | 184862 | 0.9% |
| road | 183771 | 0.9% |
| mi | 174123 | 0.8% |
| and | 171053 | 0.8% |
| Other values (427681) | 18346488 |
Most occurring characters
| Value | Count | Frequency (%) |
| 18057579 | 13.7% | |
| a | 12014815 | 9.1% |
| e | 8972168 | 6.8% |
| o | 8710903 | 6.6% |
| n | 7329088 | 5.6% |
| i | 6648936 | 5.0% |
| r | 6396546 | 4.9% |
| t | 5851047 | 4.4% |
| l | 4796222 | 3.6% |
| s | 4619692 | 3.5% |
| Other values (349) | 48378731 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 89732453 | |
| Space Separator | 18057579 | 13.7% |
| Uppercase Letter | 15046232 | 11.4% |
| Other Punctuation | 5929390 | 4.5% |
| Decimal Number | 1979217 | 1.5% |
| Open Punctuation | 302503 | 0.2% |
| Close Punctuation | 301254 | 0.2% |
| Dash Punctuation | 279930 | 0.2% |
| Control | 108177 | 0.1% |
| Math Symbol | 24799 | < 0.1% |
| Other values (11) | 14193 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12014815 | |
| e | 8972168 | |
| o | 8710903 | |
| n | 7329088 | 8.2% |
| i | 6648936 | 7.4% |
| r | 6396546 | 7.1% |
| t | 5851047 | 6.5% |
| l | 4796222 | 5.3% |
| s | 4619692 | 5.1% |
| u | 3306189 | 3.7% |
| Other values (145) | 21086847 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1539532 | 10.2% |
| C | 1531103 | 10.2% |
| M | 1059073 | 7.0% |
| P | 1021572 | 6.8% |
| R | 999458 | 6.6% |
| B | 928277 | 6.2% |
| N | 860328 | 5.7% |
| A | 723736 | 4.8% |
| L | 658662 | 4.4% |
| I | 650936 | 4.3% |
| Other values (75) | 5073555 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2750592 | |
| . | 2643574 | |
| : | 201217 | 3.4% |
| ; | 131007 | 2.2% |
| ' | 94383 | 1.6% |
| " | 44710 | 0.8% |
| / | 32011 | 0.5% |
| & | 18568 | 0.3% |
| ? | 5972 | 0.1% |
| # | 5738 | 0.1% |
| Other values (9) | 1618 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 107476 | ||
| 570 | 0.5% | |
| 31 | < 0.1% | |
| | 25 | < 0.1% |
| 23 | < 0.1% | |
| | 19 | < 0.1% |
| | 15 | < 0.1% |
| | 11 | < 0.1% |
| | 4 | < 0.1% |
| | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 396666 | |
| 2 | 278718 | |
| 0 | 272453 | |
| 5 | 230645 | |
| 3 | 198601 | |
| 4 | 158189 | 8.0% |
| 6 | 141329 | 7.1% |
| 7 | 108515 | 5.5% |
| 8 | 103465 | 5.2% |
| 9 | 90636 | 4.6% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 16935 | |
| + | 3758 | 15.2% |
| ± | 2194 | 8.8% |
| ~ | 698 | 2.8% |
| > | 661 | 2.7% |
| < | 490 | 2.0% |
| | | 52 | 0.2% |
| → | 7 | < 0.1% |
| ∆ | 3 | < 0.1% |
| ↔ | 1 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3220 | |
| ├ | 9 | 0.3% |
| ░ | 4 | 0.1% |
| ┬ | 4 | 0.1% |
| © | 1 | < 0.1% |
| │ | 1 | < 0.1% |
| ▒ | 1 | < 0.1% |
| ¦ | 1 | < 0.1% |
| ╢ | 1 | < 0.1% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 5249 | |
| ¼ | 2255 | |
| ¾ | 305 | 3.9% |
| ² | 35 | 0.4% |
| ⅓ | 25 | 0.3% |
| ³ | 3 | < 0.1% |
| ⅛ | 3 | < 0.1% |
| ⅜ | 2 | < 0.1% |
Format
| Value | Count | Frequency (%) |
| | 60 | |
| | 2 | 2.8% |
| | 2 | 2.8% |
| | 2 | 2.8% |
| | 2 | 2.8% |
| | 1 | 1.4% |
| | 1 | 1.4% |
| | 1 | 1.4% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ̩ | 2 | |
| ̈ | 2 | |
| ̄ | 2 | |
| ̌ | 1 | |
| ᷉ | 1 | |
| ́ | 1 | |
| ͤ | 1 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 187 | |
| ᵉ | 3 | 1.5% |
| ᴸ | 1 | 0.5% |
| ᴱ | 1 | 0.5% |
| ᵍ | 1 | 0.5% |
| ᵈ | 1 | 0.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 227265 | |
| [ | 74898 | 24.8% |
| „ | 161 | 0.1% |
| { | 94 | < 0.1% |
| ‚ | 85 | < 0.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 209 | |
| ¨ | 13 | 5.6% |
| ^ | 10 | 4.3% |
| ¯ | 1 | 0.4% |
| ˚ | 1 | 0.4% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 58 | |
| ¤ | 32 | |
| $ | 7 | 6.7% |
| £ | 7 | 6.7% |
| ¥ | 1 | 1.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 279899 | |
| – | 22 | < 0.1% |
| — | 9 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 226165 | |
| ] | 74983 | 24.9% |
| } | 106 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 357 | |
| ” | 30 | 7.6% |
| › | 8 | 2.0% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 349 | |
| “ | 65 | 15.7% |
| ‛ | 1 | 0.2% |
Other Letter
| Value | Count | Frequency (%) |
| º | 1340 | |
| ª | 34 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 18057579 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 276 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 104780003 | |
| Common | 26995650 | 20.5% |
| Greek | 62 | < 0.1% |
| Inherited | 11 | < 0.1% |
| Cyrillic | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12014815 | 11.5% |
| e | 8972168 | 8.6% |
| o | 8710903 | 8.3% |
| n | 7329088 | 7.0% |
| i | 6648936 | 6.3% |
| r | 6396546 | 6.1% |
| t | 5851047 | 5.6% |
| l | 4796222 | 4.6% |
| s | 4619692 | 4.4% |
| u | 3306189 | 3.2% |
| Other values (225) | 36134397 |
Common
| Value | Count | Frequency (%) |
| 18057579 | ||
| , | 2750592 | 10.2% |
| . | 2643574 | 9.8% |
| 1 | 396666 | 1.5% |
| - | 279899 | 1.0% |
| 2 | 278718 | 1.0% |
| 0 | 272453 | 1.0% |
| 5 | 230645 | 0.9% |
| ( | 227265 | 0.8% |
| ) | 226165 | 0.8% |
| Other values (94) | 1632094 | 6.0% |
Greek
| Value | Count | Frequency (%) |
| λ | 14 | |
| ν | 11 | |
| η | 7 | |
| Κ | 7 | |
| ή | 7 | |
| υ | 7 | |
| Π | 2 | 3.2% |
| ω | 2 | 3.2% |
| ρ | 2 | 3.2% |
| ά | 2 | 3.2% |
Inherited
| Value | Count | Frequency (%) |
| ̩ | 2 | |
| ̈ | 2 | |
| ̄ | 2 | |
| | 1 | |
| ̌ | 1 | |
| ᷉ | 1 | |
| ́ | 1 | |
| ͤ | 1 |
Cyrillic
| Value | Count | Frequency (%) |
| ӗ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 131387252 | |
| None | 387747 | 0.3% |
| Punctuation | 440 | < 0.1% |
| Modifier Letters | 188 | < 0.1% |
| Number Forms | 30 | < 0.1% |
| Latin Ext Additional | 18 | < 0.1% |
| Box Drawing | 15 | < 0.1% |
| Diacriticals | 9 | < 0.1% |
| Arrows | 8 | < 0.1% |
| Phonetic Ext | 7 | < 0.1% |
| Other values (6) | 13 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 18057579 | 13.7% | |
| a | 12014815 | 9.1% |
| e | 8972168 | 6.8% |
| o | 8710903 | 6.6% |
| n | 7329088 | 5.6% |
| i | 6648936 | 5.1% |
| r | 6396546 | 4.9% |
| t | 5851047 | 4.5% |
| l | 4796222 | 3.7% |
| s | 4619692 | 3.5% |
| Other values (90) | 47990256 |
None
| Value | Count | Frequency (%) |
| í | 96918 | |
| á | 69327 | |
| é | 46538 | |
| ó | 38861 | |
| ñ | 19435 | 5.0% |
| ã | 15351 | 4.0% |
| ú | 10979 | 2.8% |
| ç | 9491 | 2.4% |
| ü | 7701 | 2.0% |
| ä | 7079 | 1.8% |
| Other values (192) | 66067 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 187 | |
| ˚ | 1 | 0.5% |
Punctuation
| Value | Count | Frequency (%) |
| „ | 161 | |
| ‚ | 85 | |
| “ | 65 | |
| … | 49 | 11.1% |
| ” | 30 | 6.8% |
| – | 22 | 5.0% |
| — | 9 | 2.0% |
| › | 8 | 1.8% |
| | 2 | 0.5% |
| | 2 | 0.5% |
| Other values (6) | 7 | 1.6% |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 25 | |
| ⅛ | 3 | 10.0% |
| ⅜ | 2 | 6.7% |
Box Drawing
| Value | Count | Frequency (%) |
| ├ | 9 | |
| ┬ | 4 | |
| │ | 1 | 6.7% |
| ╢ | 1 | 6.7% |
Arrows
| Value | Count | Frequency (%) |
| → | 7 | |
| ↔ | 1 | 12.5% |
Block Elements
| Value | Count | Frequency (%) |
| ░ | 4 | |
| ▒ | 1 | 20.0% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ắ | 3 | |
| ḿ | 3 | |
| ḗ | 2 | |
| ẑ | 2 | |
| ấ | 1 | 5.6% |
| ṁ | 1 | 5.6% |
| ộ | 1 | 5.6% |
| ṅ | 1 | 5.6% |
| ế | 1 | 5.6% |
| ạ | 1 | 5.6% |
| Other values (2) | 2 |
Phonetic Ext
| Value | Count | Frequency (%) |
| ᵉ | 3 | |
| ᴸ | 1 | 14.3% |
| ᴱ | 1 | 14.3% |
| ᵍ | 1 | 14.3% |
| ᵈ | 1 | 14.3% |
Math Operators
| Value | Count | Frequency (%) |
| ∆ | 3 |
Diacriticals
| Value | Count | Frequency (%) |
| ̩ | 2 | |
| ̈ | 2 | |
| ̄ | 2 | |
| ̌ | 1 | |
| ́ | 1 | |
| ͤ | 1 |
IPA Ext
| Value | Count | Frequency (%) |
| ɶ | 2 |
Diacriticals Sup
| Value | Count | Frequency (%) |
| ᷉ | 1 |
Cyrillic
| Value | Count | Frequency (%) |
| ӗ | 1 |
Greek Ext
| Value | Count | Frequency (%) |
| ᾰ | 1 |
verbatimLocality
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814096 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 9 |
| Min length | 3 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 78 50' 50" W |
|---|---|
| 2nd row | 4.6 |
| 3rd row | 79 51'48.5"W |
| Value | Count | Frequency (%) |
| 50 | 2 | |
| 78 | 1 | |
| w | 1 | |
| 4.6 | 1 | |
| 79 | 1 | |
| 51'48.5"w | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | ||
| 5 | 4 | |
| 7 | 2 | |
| 8 | 2 | |
| 0 | 2 | |
| ' | 2 | |
| " | 2 | |
| W | 2 | |
| 4 | 2 | |
| . | 2 | |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15 | |
| Other Punctuation | 6 | 22.2% |
| Space Separator | 4 | 14.8% |
| Uppercase Letter | 2 | 7.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 4 | |
| 7 | 2 | |
| 8 | 2 | |
| 0 | 2 | |
| 4 | 2 | |
| 6 | 1 | 6.7% |
| 9 | 1 | 6.7% |
| 1 | 1 | 6.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 2 | |
| " | 2 | |
| . | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 25 | |
| Latin | 2 | 7.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | ||
| 5 | 4 | |
| 7 | 2 | |
| 8 | 2 | |
| 0 | 2 | |
| ' | 2 | |
| " | 2 | |
| 4 | 2 | |
| . | 2 | |
| 6 | 1 | 4.0% |
| Other values (2) | 2 |
Latin
| Value | Count | Frequency (%) |
| W | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | ||
| 5 | 4 | |
| 7 | 2 | |
| 8 | 2 | |
| 0 | 2 | |
| ' | 2 | |
| " | 2 | |
| W | 2 | |
| 4 | 2 | |
| . | 2 | |
| Other values (3) | 3 |
Missing 
| Distinct | 4481 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 2930460 |
| Missing (%) | 76.8% |
| Memory size | 29.1 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 9 |
| Mean length | 5.322696259 |
| Min length | 3 |
Unique
| Unique | 679 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 1049.0 |
|---|---|
| 2nd row | 140.0 |
| 3rd row | 2880.0 |
| 4th row | 1219.0 |
| 5th row | 1100.0 |
| Value | Count | Frequency (%) |
| 1000.0 | 14755 | 1.7% |
| 100.0 | 14661 | 1.7% |
| 200.0 | 14161 | 1.6% |
| 300.0 | 11873 | 1.3% |
| 500.0 | 11745 | 1.3% |
| 1500.0 | 11723 | 1.3% |
| 0.0 | 10988 | 1.2% |
| 800.0 | 10953 | 1.2% |
| 900.0 | 10515 | 1.2% |
| 400.0 | 10358 | 1.2% |
| Other values (4452) | 761911 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1744284 | |
| . | 883637 | |
| 1 | 466563 | 9.9% |
| 2 | 333783 | 7.1% |
| 5 | 269825 | 5.7% |
| 3 | 228074 | 4.8% |
| 4 | 183692 | 3.9% |
| 6 | 159411 | 3.4% |
| 7 | 150513 | 3.2% |
| 8 | 147530 | 3.1% |
| Other values (17) | 136030 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3819512 | |
| Other Punctuation | 883637 | 18.8% |
| Dash Punctuation | 147 | < 0.1% |
| Lowercase Letter | 36 | < 0.1% |
| Uppercase Letter | 6 | < 0.1% |
| Space Separator | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 10 | |
| s | 6 | |
| n | 4 | 11.1% |
| g | 2 | 5.6% |
| r | 2 | 5.6% |
| i | 2 | 5.6% |
| u | 2 | 5.6% |
| t | 2 | 5.6% |
| c | 2 | 5.6% |
| o | 2 | 5.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1744284 | |
| 1 | 466563 | 12.2% |
| 2 | 333783 | 8.7% |
| 5 | 269825 | 7.1% |
| 3 | 228074 | 6.0% |
| 4 | 183692 | 4.8% |
| 6 | 159411 | 4.2% |
| 7 | 150513 | 3.9% |
| 8 | 147530 | 3.9% |
| 9 | 135837 | 3.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 2 | |
| M | 2 | |
| S | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 883637 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 147 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4703300 | |
| Latin | 42 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 10 | |
| s | 6 | |
| n | 4 | 9.5% |
| D | 2 | 4.8% |
| g | 2 | 4.8% |
| r | 2 | 4.8% |
| M | 2 | 4.8% |
| i | 2 | 4.8% |
| u | 2 | 4.8% |
| t | 2 | 4.8% |
| Other values (4) | 8 |
Common
| Value | Count | Frequency (%) |
| 0 | 1744284 | |
| . | 883637 | |
| 1 | 466563 | 9.9% |
| 2 | 333783 | 7.1% |
| 5 | 269825 | 5.7% |
| 3 | 228074 | 4.8% |
| 4 | 183692 | 3.9% |
| 6 | 159411 | 3.4% |
| 7 | 150513 | 3.2% |
| 8 | 147530 | 3.1% |
| Other values (3) | 135988 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4703342 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1744284 | |
| . | 883637 | |
| 1 | 466563 | 9.9% |
| 2 | 333783 | 7.1% |
| 5 | 269825 | 5.7% |
| 3 | 228074 | 4.8% |
| 4 | 183692 | 3.9% |
| 6 | 159411 | 3.4% |
| 7 | 150513 | 3.2% |
| 8 | 147530 | 3.1% |
| Other values (17) | 136030 | 2.9% |
Missing 
| Distinct | 2781 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 3486461 |
| Missing (%) | 91.4% |
| Memory size | 29.1 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.334207265 |
| Min length | 3 |
Unique
| Unique | 622 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 1146.0 |
|---|---|
| 2nd row | 24.0 |
| 3rd row | 2000.0 |
| 4th row | 600.0 |
| 5th row | 700.0 |
| Value | Count | Frequency (%) |
| 1000.0 | 5570 | 1.7% |
| 1500.0 | 4985 | 1.5% |
| 600.0 | 4930 | 1.5% |
| 500.0 | 4852 | 1.5% |
| 200.0 | 4632 | 1.4% |
| 900.0 | 4315 | 1.3% |
| 1200.0 | 4276 | 1.3% |
| 100.0 | 4187 | 1.3% |
| 300.0 | 4027 | 1.2% |
| 400.0 | 3889 | 1.2% |
| Other values (2767) | 281975 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 651948 | |
| . | 327638 | |
| 1 | 178996 | 10.2% |
| 2 | 118924 | 6.8% |
| 5 | 96009 | 5.5% |
| 3 | 85495 | 4.9% |
| 4 | 68593 | 3.9% |
| 6 | 61434 | 3.5% |
| 7 | 55678 | 3.2% |
| 8 | 52906 | 3.0% |
| Other values (2) | 50068 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1420040 | |
| Other Punctuation | 327638 | 18.7% |
| Dash Punctuation | 11 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 651948 | |
| 1 | 178996 | 12.6% |
| 2 | 118924 | 8.4% |
| 5 | 96009 | 6.8% |
| 3 | 85495 | 6.0% |
| 4 | 68593 | 4.8% |
| 6 | 61434 | 4.3% |
| 7 | 55678 | 3.9% |
| 8 | 52906 | 3.7% |
| 9 | 50057 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 327638 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1747689 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 651948 | |
| . | 327638 | |
| 1 | 178996 | 10.2% |
| 2 | 118924 | 6.8% |
| 5 | 96009 | 5.5% |
| 3 | 85495 | 4.9% |
| 4 | 68593 | 3.9% |
| 6 | 61434 | 3.5% |
| 7 | 55678 | 3.2% |
| 8 | 52906 | 3.0% |
| Other values (2) | 50068 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1747689 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 651948 | |
| . | 327638 | |
| 1 | 178996 | 10.2% |
| 2 | 118924 | 6.8% |
| 5 | 96009 | 5.5% |
| 3 | 85495 | 4.9% |
| 4 | 68593 | 3.9% |
| 6 | 61434 | 3.5% |
| 7 | 55678 | 3.2% |
| 8 | 52906 | 3.0% |
| Other values (2) | 50068 | 2.9% |
Missing 
| Distinct | 3250 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 3703697 |
| Missing (%) | 97.1% |
| Memory size | 29.1 MiB |
Length
| Max length | 152 |
|---|---|
| Median length | 124 |
| Mean length | 7.486739371 |
| Min length | 1 |
Unique
| Unique | 807 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 3600 (3440-3760) ft |
|---|---|
| 2nd row | ~1800 ft. |
| 3rd row | 80 ft |
| 4th row | 160 m |
| 5th row | 150 m |
| Value | Count | Frequency (%) |
| ft | 79883 | |
| m | 25814 | 11.1% |
| ca | 5656 | 2.4% |
| feet | 1786 | 0.8% |
| 200 | 1755 | 0.8% |
| 1100-1350 | 1649 | 0.7% |
| 10 | 1423 | 0.6% |
| 20 | 1246 | 0.5% |
| 3500 | 1175 | 0.5% |
| 3400 | 1167 | 0.5% |
| Other values (2139) | 111547 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 164251 | |
| 122699 | ||
| t | 85121 | |
| f | 82821 | |
| 1 | 43159 | 5.2% |
| 3 | 41182 | 5.0% |
| 2 | 39236 | 4.7% |
| 4 | 35494 | 4.3% |
| 5 | 33415 | 4.0% |
| m | 27534 | 3.3% |
| Other values (69) | 151639 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 429592 | |
| Lowercase Letter | 249040 | |
| Space Separator | 122699 | 14.8% |
| Dash Punctuation | 12951 | 1.6% |
| Other Punctuation | 8620 | 1.0% |
| Uppercase Letter | 2190 | 0.3% |
| Open Punctuation | 636 | 0.1% |
| Close Punctuation | 636 | 0.1% |
| Math Symbol | 187 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 85121 | |
| f | 82821 | |
| m | 27534 | 11.1% |
| e | 12456 | 5.0% |
| a | 9784 | 3.9% |
| c | 6790 | 2.7% |
| s | 4113 | 1.7% |
| l | 3680 | 1.5% |
| o | 3258 | 1.3% |
| r | 2692 | 1.1% |
| Other values (15) | 10791 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 611 | |
| T | 289 | |
| P | 240 | 11.0% |
| W | 220 | 10.0% |
| R | 188 | 8.6% |
| A | 168 | 7.7% |
| C | 93 | 4.2% |
| N | 61 | 2.8% |
| G | 52 | 2.4% |
| S | 37 | 1.7% |
| Other values (13) | 231 | 10.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 164251 | |
| 1 | 43159 | 10.0% |
| 3 | 41182 | 9.6% |
| 2 | 39236 | 9.1% |
| 4 | 35494 | 8.3% |
| 5 | 33415 | 7.8% |
| 6 | 25083 | 5.8% |
| 8 | 19351 | 4.5% |
| 7 | 16238 | 3.8% |
| 9 | 12183 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7357 | |
| : | 599 | 6.9% |
| ' | 280 | 3.2% |
| , | 249 | 2.9% |
| " | 50 | 0.6% |
| ? | 45 | 0.5% |
| ; | 21 | 0.2% |
| / | 11 | 0.1% |
| & | 7 | 0.1% |
| ‡ | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 96 | |
| + | 32 | 17.1% |
| = | 30 | 16.0% |
| > | 17 | 9.1% |
| ~ | 12 | 6.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 580 | |
| [ | 56 | 8.8% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 580 | |
| ] | 56 | 8.8% |
Space Separator
| Value | Count | Frequency (%) |
| 122699 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12951 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 575321 | |
| Latin | 251230 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 85121 | |
| f | 82821 | |
| m | 27534 | 11.0% |
| e | 12456 | 5.0% |
| a | 9784 | 3.9% |
| c | 6790 | 2.7% |
| s | 4113 | 1.6% |
| l | 3680 | 1.5% |
| o | 3258 | 1.3% |
| r | 2692 | 1.1% |
| Other values (38) | 12981 | 5.2% |
Common
| Value | Count | Frequency (%) |
| 0 | 164251 | |
| 122699 | ||
| 1 | 43159 | 7.5% |
| 3 | 41182 | 7.2% |
| 2 | 39236 | 6.8% |
| 4 | 35494 | 6.2% |
| 5 | 33415 | 5.8% |
| 6 | 25083 | 4.4% |
| 8 | 19351 | 3.4% |
| 7 | 16238 | 2.8% |
| Other values (21) | 35213 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 826550 | |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 164251 | |
| 122699 | ||
| t | 85121 | |
| f | 82821 | |
| 1 | 43159 | 5.2% |
| 3 | 41182 | 5.0% |
| 2 | 39236 | 4.7% |
| 4 | 35494 | 4.3% |
| 5 | 33415 | 4.0% |
| m | 27534 | 3.3% |
| Other values (68) | 151638 |
Punctuation
| Value | Count | Frequency (%) |
| ‡ | 1 |
Missing 
| Distinct | 5449 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 3390497 |
| Missing (%) | 88.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 4.186890052 |
| Min length | 3 |
Unique
| Unique | 1641 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 9.1 |
|---|---|
| 2nd row | 200.0 |
| 3rd row | 3200.0 |
| 4th row | 30.0 |
| 5th row | 844.0 |
| Value | Count | Frequency (%) |
| 0.0 | 50928 | 12.0% |
| 1.0 | 10648 | 2.5% |
| 3.0 | 9584 | 2.3% |
| 2.0 | 8926 | 2.1% |
| 15.0 | 7397 | 1.7% |
| 18.0 | 5876 | 1.4% |
| 9.0 | 5651 | 1.3% |
| 27.0 | 4728 | 1.1% |
| 37.0 | 4372 | 1.0% |
| 5.0 | 4195 | 1.0% |
| Other values (5437) | 311297 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 547302 | |
| . | 423602 | |
| 1 | 159203 | 9.0% |
| 2 | 119170 | 6.7% |
| 5 | 94176 | 5.3% |
| 3 | 91862 | 5.2% |
| 4 | 80810 | 4.6% |
| 8 | 70149 | 4.0% |
| 6 | 67703 | 3.8% |
| 7 | 61571 | 3.5% |
| Other values (2) | 58027 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1349952 | |
| Other Punctuation | 423602 | 23.9% |
| Dash Punctuation | 21 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 547302 | |
| 1 | 159203 | 11.8% |
| 2 | 119170 | 8.8% |
| 5 | 94176 | 7.0% |
| 3 | 91862 | 6.8% |
| 4 | 80810 | 6.0% |
| 8 | 70149 | 5.2% |
| 6 | 67703 | 5.0% |
| 7 | 61571 | 4.6% |
| 9 | 58006 | 4.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 423602 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1773575 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 547302 | |
| . | 423602 | |
| 1 | 159203 | 9.0% |
| 2 | 119170 | 6.7% |
| 5 | 94176 | 5.3% |
| 3 | 91862 | 5.2% |
| 4 | 80810 | 4.6% |
| 8 | 70149 | 4.0% |
| 6 | 67703 | 3.8% |
| 7 | 61571 | 3.5% |
| Other values (2) | 58027 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1773575 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 547302 | |
| . | 423602 | |
| 1 | 159203 | 9.0% |
| 2 | 119170 | 6.7% |
| 5 | 94176 | 5.3% |
| 3 | 91862 | 5.2% |
| 4 | 80810 | 4.6% |
| 8 | 70149 | 4.0% |
| 6 | 67703 | 3.8% |
| 7 | 61571 | 3.5% |
| Other values (2) | 58027 | 3.3% |
Missing 
| Distinct | 5288 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 3423246 |
| Missing (%) | 89.8% |
| Memory size | 29.1 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 4.294187329 |
| Min length | 3 |
Unique
| Unique | 1538 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 9.1 |
|---|---|
| 2nd row | 200.0 |
| 3rd row | 3200.0 |
| 4th row | 50.0 |
| 5th row | 804.0 |
| Value | Count | Frequency (%) |
| 1.0 | 18208 | 4.7% |
| 2.0 | 8861 | 2.3% |
| 3.0 | 8173 | 2.1% |
| 5.0 | 7349 | 1.9% |
| 9.0 | 5992 | 1.5% |
| 15.0 | 5745 | 1.5% |
| 18.0 | 5657 | 1.4% |
| 6.0 | 5297 | 1.4% |
| 27.0 | 5260 | 1.3% |
| 0.0 | 4935 | 1.3% |
| Other values (5276) | 315376 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 473419 | |
| . | 390853 | |
| 1 | 171730 | 10.2% |
| 2 | 121541 | 7.2% |
| 5 | 96119 | 5.7% |
| 3 | 89936 | 5.4% |
| 4 | 79728 | 4.8% |
| 8 | 68887 | 4.1% |
| 6 | 67913 | 4.0% |
| 7 | 61466 | 3.7% |
| Other values (2) | 56804 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1287522 | |
| Other Punctuation | 390853 | 23.3% |
| Dash Punctuation | 21 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 473419 | |
| 1 | 171730 | 13.3% |
| 2 | 121541 | 9.4% |
| 5 | 96119 | 7.5% |
| 3 | 89936 | 7.0% |
| 4 | 79728 | 6.2% |
| 8 | 68887 | 5.4% |
| 6 | 67913 | 5.3% |
| 7 | 61466 | 4.8% |
| 9 | 56783 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 390853 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1678396 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 473419 | |
| . | 390853 | |
| 1 | 171730 | 10.2% |
| 2 | 121541 | 7.2% |
| 5 | 96119 | 5.7% |
| 3 | 89936 | 5.4% |
| 4 | 79728 | 4.8% |
| 8 | 68887 | 4.1% |
| 6 | 67913 | 4.0% |
| 7 | 61466 | 3.7% |
| Other values (2) | 56804 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1678396 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 473419 | |
| . | 390853 | |
| 1 | 171730 | 10.2% |
| 2 | 121541 | 7.2% |
| 5 | 96119 | 5.7% |
| 3 | 89936 | 5.4% |
| 4 | 79728 | 4.8% |
| 8 | 68887 | 4.1% |
| 6 | 67913 | 4.0% |
| 7 | 61466 | 3.7% |
| Other values (2) | 56804 | 3.4% |
verbatimDepth
Text
Missing 
| Distinct | 1114 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 3790849 |
| Missing (%) | 99.4% |
| Memory size | 29.1 MiB |
Length
| Max length | 147466 |
|---|---|
| Median length | 91 |
| Mean length | 15.05470968 |
| Min length | 1 |
Unique
| Unique | 551 ? |
|---|---|
| Unique (%) | 2.4% |
Sample
| 1st row | Littoral |
|---|---|
| 2nd row | 00000000, 00000013 |
| 3rd row | penetration depth: 15cm |
| 4th row | 1 ms ca. |
| 5th row | Intertidal |
| Value | Count | Frequency (%) |
| ca | 10580 | |
| intertidal | 4974 | 10.3% |
| surface | 2615 | 5.4% |
| depths | 1198 | 2.5% |
| recorded | 1194 | 2.5% |
| multiple | 1187 | 2.5% |
| depth | 796 | 1.6% |
| shore | 499 | 1.0% |
| at | 486 | 1.0% |
| 0-300 | 481 | 1.0% |
| Other values (4738) | 24276 |
Most occurring characters
| Value | Count | Frequency (%) |
| 35104 | 10.0% | |
| a | 27031 | 7.7% |
| e | 23974 | 6.8% |
| t | 21796 | 6.2% |
| 18475 | 5.3% | |
| c | 16923 | 4.8% |
| r | 16167 | 4.6% |
| i | 14121 | 4.0% |
| d | 13514 | 3.9% |
| l | 12486 | 3.6% |
| Other values (90) | 150431 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 203573 | |
| Decimal Number | 42845 | 12.2% |
| Control | 35290 | 10.1% |
| Uppercase Letter | 22929 | 6.6% |
| Other Punctuation | 21619 | 6.2% |
| Space Separator | 18475 | 5.3% |
| Dash Punctuation | 4705 | 1.3% |
| Math Symbol | 221 | 0.1% |
| Open Punctuation | 184 | 0.1% |
| Close Punctuation | 180 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 27031 | |
| e | 23974 | |
| t | 21796 | |
| c | 16923 | |
| r | 16167 | |
| i | 14121 | 6.9% |
| d | 13514 | 6.6% |
| l | 12486 | 6.1% |
| n | 11247 | 5.5% |
| o | 8998 | 4.4% |
| Other values (28) | 37316 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 4794 | |
| S | 4128 | |
| A | 2642 | |
| M | 2208 | |
| C | 2204 | |
| N | 1087 | 4.7% |
| P | 851 | 3.7% |
| U | 672 | 2.9% |
| L | 573 | 2.5% |
| E | 490 | 2.1% |
| Other values (16) | 3280 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11298 | |
| , | 4253 | 19.7% |
| : | 3616 | 16.7% |
| / | 1434 | 6.6% |
| " | 380 | 1.8% |
| ' | 255 | 1.2% |
| ; | 189 | 0.9% |
| ? | 114 | 0.5% |
| & | 58 | 0.3% |
| @ | 16 | 0.1% |
| Other values (3) | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12422 | |
| 1 | 5730 | |
| 2 | 4152 | 9.7% |
| 3 | 3884 | 9.1% |
| 5 | 3261 | 7.6% |
| 8 | 3129 | 7.3% |
| 6 | 2893 | 6.8% |
| 4 | 2705 | 6.3% |
| 7 | 2371 | 5.5% |
| 9 | 2298 | 5.4% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 136 | |
| < | 60 | |
| + | 13 | 5.9% |
| ~ | 12 | 5.4% |
Control
| Value | Count | Frequency (%) |
| 35104 | ||
| 186 | 0.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 179 | |
| [ | 5 | 2.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 175 | |
| ] | 5 | 2.8% |
Space Separator
| Value | Count | Frequency (%) |
| 18475 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4705 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 226502 | |
| Common | 123520 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 27031 | |
| e | 23974 | 10.6% |
| t | 21796 | 9.6% |
| c | 16923 | 7.5% |
| r | 16167 | 7.1% |
| i | 14121 | 6.2% |
| d | 13514 | 6.0% |
| l | 12486 | 5.5% |
| n | 11247 | 5.0% |
| o | 8998 | 4.0% |
| Other values (54) | 60245 |
Common
| Value | Count | Frequency (%) |
| 35104 | ||
| 18475 | ||
| 0 | 12422 | 10.1% |
| . | 11298 | 9.1% |
| 1 | 5730 | 4.6% |
| - | 4705 | 3.8% |
| , | 4253 | 3.4% |
| 2 | 4152 | 3.4% |
| 3 | 3884 | 3.1% |
| : | 3616 | 2.9% |
| Other values (26) | 19881 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 349964 | |
| None | 58 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 35104 | 10.0% | |
| a | 27031 | 7.7% |
| e | 23974 | 6.9% |
| t | 21796 | 6.2% |
| 18475 | 5.3% | |
| c | 16923 | 4.8% |
| r | 16167 | 4.6% |
| i | 14121 | 4.0% |
| d | 13514 | 3.9% |
| l | 12486 | 3.6% |
| Other values (78) | 150373 |
None
| Value | Count | Frequency (%) |
| í | 12 | |
| ó | 9 | |
| á | 8 | |
| ü | 7 | |
| é | 7 | |
| ô | 6 | |
| ñ | 2 | 3.4% |
| ä | 2 | 3.4% |
| ã | 2 | 3.4% |
| ø | 1 | 1.7% |
| Other values (2) | 2 | 3.4% |
locationRemarks
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 16 |
| Min length | 16 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | DeFilipps, R. A. |
|---|
| Value | Count | Frequency (%) |
| defilipps | 1 | |
| r | 1 | |
| a | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2 | |
| p | 2 | |
| 2 | ||
| . | 2 | |
| D | 1 | |
| e | 1 | |
| F | 1 | |
| l | 1 | |
| s | 1 | |
| , | 1 | |
| Other values (2) | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 4 | |
| Other Punctuation | 3 | |
| Space Separator | 2 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2 | |
| p | 2 | |
| e | 1 | |
| l | 1 | |
| s | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1 | |
| F | 1 | |
| R | 1 | |
| A | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 | |
| , | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11 | |
| Common | 5 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2 | |
| p | 2 | |
| D | 1 | |
| e | 1 | |
| F | 1 | |
| l | 1 | |
| s | 1 | |
| R | 1 | |
| A | 1 |
Common
| Value | Count | Frequency (%) |
| 2 | ||
| . | 2 | |
| , | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2 | |
| p | 2 | |
| 2 | ||
| . | 2 | |
| D | 1 | |
| e | 1 | |
| F | 1 | |
| l | 1 | |
| s | 1 | |
| , | 1 | |
| Other values (2) | 2 |
decimalLatitude
Text
Missing 
| Distinct | 119398 |
|---|---|
| Distinct (%) | 10.4% |
| Missing | 2665103 |
| Missing (%) | 69.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 6.153433955 |
| Min length | 3 |
Unique
| Unique | 49829 ? |
|---|---|
| Unique (%) | 4.3% |
Sample
| 1st row | 16.8033 |
|---|---|
| 2nd row | 38.9361 |
| 3rd row | 29.2483 |
| 4th row | 44.8831 |
| 5th row | 29.2586 |
| Value | Count | Frequency (%) |
| 25.58 | 4259 | 0.4% |
| 40.6583 | 3632 | 0.3% |
| 26.17 | 3044 | 0.3% |
| 26.5 | 2214 | 0.2% |
| 39.6891 | 2124 | 0.2% |
| 38.9694 | 1853 | 0.2% |
| 39.6306 | 1749 | 0.2% |
| 38.895 | 1685 | 0.1% |
| 26.97 | 1656 | 0.1% |
| 60.75 | 1583 | 0.1% |
| Other values (110672) | 1125197 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 1148996 | |
| 3 | 918831 | |
| 2 | 629781 | |
| 1 | 613004 | |
| 5 | 564550 | |
| 8 | 556515 | |
| 7 | 544946 | |
| 4 | 530325 | |
| 6 | 524922 | |
| 9 | 445804 | 6.3% |
| Other values (3) | 592597 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5698095 | |
| Other Punctuation | 1148996 | 16.3% |
| Dash Punctuation | 223153 | 3.2% |
| Uppercase Letter | 27 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 918831 | |
| 2 | 629781 | |
| 1 | 613004 | |
| 5 | 564550 | |
| 8 | 556515 | |
| 7 | 544946 | |
| 4 | 530325 | |
| 6 | 524922 | |
| 9 | 445804 | |
| 0 | 369417 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1148996 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 223153 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 27 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7070244 | |
| Latin | 27 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 1148996 | |
| 3 | 918831 | |
| 2 | 629781 | |
| 1 | 613004 | |
| 5 | 564550 | |
| 8 | 556515 | |
| 7 | 544946 | |
| 4 | 530325 | |
| 6 | 524922 | |
| 9 | 445804 | 6.3% |
| Other values (2) | 592570 |
Latin
| Value | Count | Frequency (%) |
| E | 27 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7070271 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 1148996 | |
| 3 | 918831 | |
| 2 | 629781 | |
| 1 | 613004 | |
| 5 | 564550 | |
| 8 | 556515 | |
| 7 | 544946 | |
| 4 | 530325 | |
| 6 | 524922 | |
| 9 | 445804 | 6.3% |
| Other values (3) | 592597 |
decimalLongitude
Text
Missing 
| Distinct | 124298 |
|---|---|
| Distinct (%) | 10.8% |
| Missing | 2665103 |
| Missing (%) | 69.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 7.019817301 |
| Min length | 3 |
Unique
| Unique | 49181 ? |
|---|---|
| Unique (%) | 4.3% |
Sample
| 1st row | -88.0767 |
|---|---|
| 2nd row | -79.6908 |
| 3rd row | -88.1214 |
| 4th row | -68.672 |
| 5th row | -94.9533 |
| Value | Count | Frequency (%) |
| 80.1 | 4295 | 0.4% |
| 105.644 | 2150 | 0.2% |
| 127.848 | 1835 | 0.2% |
| 77.4714 | 1749 | 0.2% |
| 88.08 | 1737 | 0.2% |
| 67.7683 | 1710 | 0.1% |
| 77.0367 | 1651 | 0.1% |
| 139.5 | 1588 | 0.1% |
| 80.13 | 1583 | 0.1% |
| 77.1767 | 1529 | 0.1% |
| Other values (114404) | 1129169 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 1148996 | |
| - | 937099 | |
| 7 | 848574 | |
| 1 | 775389 | |
| 8 | 717466 | |
| 6 | 625080 | |
| 3 | 602433 | |
| 5 | 548904 | |
| 2 | 527202 | |
| 9 | 481545 | |
| Other values (2) | 853054 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5979647 | |
| Other Punctuation | 1148996 | 14.2% |
| Dash Punctuation | 937099 | 11.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 848574 | |
| 1 | 775389 | |
| 8 | 717466 | |
| 6 | 625080 | |
| 3 | 602433 | |
| 5 | 548904 | |
| 2 | 527202 | |
| 9 | 481545 | |
| 4 | 436297 | |
| 0 | 416757 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1148996 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 937099 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8065742 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 1148996 | |
| - | 937099 | |
| 7 | 848574 | |
| 1 | 775389 | |
| 8 | 717466 | |
| 6 | 625080 | |
| 3 | 602433 | |
| 5 | 548904 | |
| 2 | 527202 | |
| 9 | 481545 | |
| Other values (2) | 853054 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8065742 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 1148996 | |
| - | 937099 | |
| 7 | 848574 | |
| 1 | 775389 | |
| 8 | 717466 | |
| 6 | 625080 | |
| 3 | 602433 | |
| 5 | 548904 | |
| 2 | 527202 | |
| 9 | 481545 | |
| Other values (2) | 853054 |
geodeticDatum
Text
Missing 
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3696977 |
| Missing (%) | 96.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 5 |
| Mean length | 8.18266423 |
| Min length | 3 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | WGS84 |
|---|---|
| 2nd row | WGS84 |
| 3rd row | WGS84 |
| 4th row | NAD27 |
| 5th row | WGS84 |
| Value | Count | Frequency (%) |
| wgs84 | 65622 | |
| 84 | 25578 | 14.7% |
| wgs | 25577 | 14.7% |
| epsg:4326 | 25144 | 14.5% |
| nad27 | 13658 | 7.9% |
| nad83 | 3959 | 2.3% |
| prp_m | 3499 | 2.0% |
| not | 2459 | 1.4% |
| recorded | 2459 | 1.4% |
| agd66 | 947 | 0.5% |
| Other values (32) | 4709 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 118860 | |
| 4 | 117158 | |
| S | 116960 | |
| 8 | 95317 | |
| W | 91309 | 9.5% |
| 56489 | 5.9% | |
| 2 | 40059 | 4.2% |
| P | 32649 | 3.4% |
| 3 | 29150 | 3.0% |
| 6 | 27541 | 2.9% |
| Other values (39) | 232878 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 451198 | |
| Decimal Number | 326062 | |
| Space Separator | 56489 | 5.9% |
| Lowercase Letter | 44172 | 4.6% |
| Close Punctuation | 25648 | 2.7% |
| Open Punctuation | 25648 | 2.7% |
| Other Punctuation | 25648 | 2.7% |
| Connector Punctuation | 3499 | 0.4% |
| Dash Punctuation | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9127 | |
| d | 6131 | |
| o | 5896 | |
| t | 4412 | |
| r | 4252 | |
| c | 3436 | 7.8% |
| n | 2902 | 6.6% |
| a | 2759 | 6.2% |
| u | 1214 | 2.7% |
| m | 977 | 2.2% |
| Other values (8) | 3066 | 6.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 118860 | |
| S | 116960 | |
| W | 91309 | |
| P | 32649 | 7.2% |
| E | 25647 | 5.7% |
| D | 19535 | 4.3% |
| A | 18803 | 4.2% |
| N | 18412 | 4.1% |
| R | 4240 | 0.9% |
| M | 3499 | 0.8% |
| Other values (4) | 1284 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 117158 | |
| 8 | 95317 | |
| 2 | 40059 | 12.3% |
| 3 | 29150 | 8.9% |
| 6 | 27541 | 8.4% |
| 7 | 13763 | 4.2% |
| 0 | 2324 | 0.7% |
| 9 | 604 | 0.2% |
| 1 | 77 | < 0.1% |
| 5 | 69 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 25647 | |
| / | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 56489 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 25648 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 25648 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3499 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 495370 | |
| Common | 463000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 118860 | |
| S | 116960 | |
| W | 91309 | |
| P | 32649 | 6.6% |
| E | 25647 | 5.2% |
| D | 19535 | 3.9% |
| A | 18803 | 3.8% |
| N | 18412 | 3.7% |
| e | 9127 | 1.8% |
| d | 6131 | 1.2% |
| Other values (22) | 37937 | 7.7% |
Common
| Value | Count | Frequency (%) |
| 4 | 117158 | |
| 8 | 95317 | |
| 56489 | ||
| 2 | 40059 | 8.7% |
| 3 | 29150 | 6.3% |
| 6 | 27541 | 5.9% |
| ) | 25648 | 5.5% |
| ( | 25648 | 5.5% |
| : | 25647 | 5.5% |
| 7 | 13763 | 3.0% |
| Other values (7) | 6580 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 958370 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 118860 | |
| 4 | 117158 | |
| S | 116960 | |
| 8 | 95317 | |
| W | 91309 | 9.5% |
| 56489 | 5.9% | |
| 2 | 40059 | 4.2% |
| P | 32649 | 3.4% |
| 3 | 29150 | 3.0% |
| 6 | 27541 | 2.9% |
| Other values (39) | 232878 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 6505 |
|---|---|
| Distinct (%) | 9.4% |
| Missing | 3744590 |
| Missing (%) | 98.2% |
| Memory size | 29.1 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 5.591376656 |
| Min length | 1 |
Unique
| Unique | 2288 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | 401.569 |
|---|---|
| 2nd row | 3246 |
| 3rd row | 3429.51 |
| 4th row | 801.569 |
| 5th row | 4233 |
| Value | Count | Frequency (%) |
| 3036 | 736 | 1.1% |
| 100 | 596 | 0.9% |
| 347.618 | 587 | 0.8% |
| 500 | 567 | 0.8% |
| 16000 | 557 | 0.8% |
| 186.684 | 539 | 0.8% |
| 1000 | 538 | 0.8% |
| 4615 | 493 | 0.7% |
| 1066 | 433 | 0.6% |
| 5615 | 430 | 0.6% |
| Other values (6495) | 64033 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 52237 | |
| 2 | 37995 | |
| 0 | 37635 | |
| 3 | 37119 | |
| . | 36994 | |
| 5 | 36439 | |
| 4 | 34540 | |
| 6 | 32987 | |
| 9 | 28494 | |
| 8 | 27891 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 351657 | |
| Other Punctuation | 36994 | 9.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 52237 | |
| 2 | 37995 | |
| 0 | 37635 | |
| 3 | 37119 | |
| 5 | 36439 | |
| 4 | 34540 | |
| 6 | 32987 | |
| 9 | 28494 | |
| 8 | 27891 | |
| 7 | 26320 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 36994 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 388651 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 52237 | |
| 2 | 37995 | |
| 0 | 37635 | |
| 3 | 37119 | |
| . | 36994 | |
| 5 | 36439 | |
| 4 | 34540 | |
| 6 | 32987 | |
| 9 | 28494 | |
| 8 | 27891 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 388651 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 52237 | |
| 2 | 37995 | |
| 0 | 37635 | |
| 3 | 37119 | |
| . | 36994 | |
| 5 | 36439 | |
| 4 | 34540 | |
| 6 | 32987 | |
| 9 | 28494 | |
| 8 | 27891 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814096 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.666666667 |
| Min length | 2 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 10 |
|---|---|
| 2nd row | 153 |
| 3rd row | 239 |
| Value | Count | Frequency (%) |
| 10 | 1 | |
| 153 | 1 | |
| 239 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 3 | 2 | |
| 0 | 1 | |
| 5 | 1 | |
| 2 | 1 | |
| 9 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 3 | 2 | |
| 0 | 1 | |
| 5 | 1 | |
| 2 | 1 | |
| 9 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 3 | 2 | |
| 0 | 1 | |
| 5 | 1 | |
| 2 | 1 | |
| 9 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 3 | 2 | |
| 0 | 1 | |
| 5 | 1 | |
| 2 | 1 | |
| 9 | 1 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814095 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 9 |
| Mean length | 5.75 |
| Min length | 2 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 10 |
|---|---|
| 2nd row | 153 |
| 3rd row | 239 |
| 4th row | Fluminicola sp. |
| Value | Count | Frequency (%) |
| 10 | 1 | |
| 153 | 1 | |
| 239 | 1 | |
| fluminicola | 1 | |
| sp | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | 8.7% |
| 3 | 2 | 8.7% |
| l | 2 | 8.7% |
| i | 2 | 8.7% |
| n | 1 | 4.3% |
| p | 1 | 4.3% |
| s | 1 | 4.3% |
| 1 | 4.3% | |
| a | 1 | 4.3% |
| o | 1 | 4.3% |
| Other values (9) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12 | |
| Decimal Number | 8 | |
| Space Separator | 1 | 4.3% |
| Uppercase Letter | 1 | 4.3% |
| Other Punctuation | 1 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2 | |
| i | 2 | |
| n | 1 | |
| p | 1 | |
| s | 1 | |
| a | 1 | |
| o | 1 | |
| c | 1 | |
| m | 1 | |
| u | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 3 | 2 | |
| 0 | 1 | |
| 9 | 1 | |
| 2 | 1 | |
| 5 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13 | |
| Common | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 2 | |
| i | 2 | |
| n | 1 | |
| p | 1 | |
| s | 1 | |
| a | 1 | |
| o | 1 | |
| c | 1 | |
| m | 1 | |
| u | 1 |
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 3 | 2 | |
| 1 | ||
| 0 | 1 | |
| 9 | 1 | |
| 2 | 1 | |
| 5 | 1 | |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | 8.7% |
| 3 | 2 | 8.7% |
| l | 2 | 8.7% |
| i | 2 | 8.7% |
| n | 1 | 4.3% |
| p | 1 | 4.3% |
| s | 1 | 4.3% |
| 1 | 4.3% | |
| a | 1 | 4.3% |
| o | 1 | 4.3% |
| Other values (9) | 9 |
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814093 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1929 |
|---|---|
| 2nd row | 2003 |
| 3rd row | 1955 |
| 4th row | 1911 |
| 5th row | 1907 |
| Value | Count | Frequency (%) |
| 1929 | 1 | |
| 2003 | 1 | |
| 1955 | 1 | |
| 1911 | 1 | |
| 1907 | 1 | |
| 1876 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 9 | 5 | |
| 0 | 3 | |
| 2 | 2 | 8.3% |
| 5 | 2 | 8.3% |
| 7 | 2 | 8.3% |
| 3 | 1 | 4.2% |
| 8 | 1 | 4.2% |
| 6 | 1 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 24 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 9 | 5 | |
| 0 | 3 | |
| 2 | 2 | 8.3% |
| 5 | 2 | 8.3% |
| 7 | 2 | 8.3% |
| 3 | 1 | 4.2% |
| 8 | 1 | 4.2% |
| 6 | 1 | 4.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 24 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 9 | 5 | |
| 0 | 3 | |
| 2 | 2 | 8.3% |
| 5 | 2 | 8.3% |
| 7 | 2 | 8.3% |
| 3 | 1 | 4.2% |
| 8 | 1 | 4.2% |
| 6 | 1 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 9 | 5 | |
| 0 | 3 | |
| 2 | 2 | 8.3% |
| 5 | 2 | 8.3% |
| 7 | 2 | 8.3% |
| 3 | 1 | 4.2% |
| 8 | 1 | 4.2% |
| 6 | 1 | 4.2% |
verbatimLatitude
Text
Missing 
| Distinct | 44509 |
|---|---|
| Distinct (%) | 13.9% |
| Missing | 3492892 |
| Missing (%) | 91.6% |
| Memory size | 29.1 MiB |
Length
| Max length | 46331 |
|---|---|
| Median length | 10 |
| Mean length | 9.341891677 |
| Min length | 1 |
Unique
| Unique | 19728 ? |
|---|---|
| Unique (%) | 6.1% |
Sample
| 1st row | 38 56 10 N |
|---|---|
| 2nd row | 44.883125 |
| 3rd row | 02 47 -- N |
| 4th row | 37 58 10 N |
| 5th row | 03 18.20' N |
| Value | Count | Frequency (%) |
| n | 202633 | 21.3% |
| 60360 | 6.3% | |
| s | 37789 | 4.0% |
| 35 | 24680 | 2.6% |
| 38 | 19881 | 2.1% |
| 39 | 18621 | 2.0% |
| 37 | 17456 | 1.8% |
| 36 | 15457 | 1.6% |
| 10 | 11742 | 1.2% |
| 00 | 11158 | 1.2% |
| Other values (27212) | 532628 |
Most occurring characters
| Value | Count | Frequency (%) |
| 627624 | ||
| 3 | 283224 | |
| 0 | 248562 | 8.3% |
| N | 235749 | 7.9% |
| 2 | 217698 | 7.3% |
| 1 | 210460 | 7.0% |
| 5 | 188524 | 6.3% |
| 4 | 185672 | 6.2% |
| - | 142824 | 4.8% |
| 8 | 115599 | 3.9% |
| Other values (97) | 544745 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1759263 | |
| Space Separator | 627624 | 20.9% |
| Uppercase Letter | 293587 | 9.8% |
| Dash Punctuation | 142826 | 4.8% |
| Other Punctuation | 104838 | 3.5% |
| Lowercase Letter | 46817 | 1.6% |
| Control | 20038 | 0.7% |
| Other Symbol | 5019 | 0.2% |
| Other Letter | 237 | < 0.1% |
| Connector Punctuation | 82 | < 0.1% |
| Other values (7) | 350 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6860 | |
| a | 4780 | |
| d | 4754 | |
| t | 3460 | 7.4% |
| i | 3459 | 7.4% |
| g | 3071 | 6.6% |
| o | 2771 | 5.9% |
| n | 2580 | 5.5% |
| r | 2476 | 5.3% |
| c | 2090 | 4.5% |
| Other values (24) | 10516 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 235749 | |
| S | 52717 | 18.0% |
| L | 617 | 0.2% |
| M | 576 | 0.2% |
| A | 514 | 0.2% |
| P | 453 | 0.2% |
| U | 361 | 0.1% |
| D | 341 | 0.1% |
| E | 281 | 0.1% |
| C | 278 | 0.1% |
| Other values (16) | 1700 | 0.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 78293 | |
| ' | 14754 | 14.1% |
| " | 4985 | 4.8% |
| ; | 3405 | 3.2% |
| : | 1208 | 1.2% |
| , | 885 | 0.8% |
| / | 821 | 0.8% |
| ′ | 177 | 0.2% |
| ? | 155 | 0.1% |
| * | 73 | 0.1% |
| Other values (5) | 82 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 283224 | |
| 0 | 248562 | |
| 2 | 217698 | |
| 1 | 210460 | |
| 5 | 188524 | |
| 4 | 185672 | |
| 8 | 115599 | |
| 9 | 108329 | 6.2% |
| 7 | 102597 | 5.8% |
| 6 | 98598 | 5.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 51 | |
| [ | 26 | |
| { | 1 | 1.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 50 | |
| ] | 26 | |
| } | 1 | 1.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 142824 | |
| – | 2 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 19932 | ||
| 106 | 0.5% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 5018 | |
| ◦ | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 67 | |
| ~ | 5 | 6.9% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 35 | |
| ˚ | 12 | 25.5% |
Space Separator
| Value | Count | Frequency (%) |
| 627624 |
Other Letter
| Value | Count | Frequency (%) |
| º | 237 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 82 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 50 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 25 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ̊ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2660039 | |
| Latin | 340641 | 11.4% |
| Inherited | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 235749 | |
| S | 52717 | 15.5% |
| e | 6860 | 2.0% |
| a | 4780 | 1.4% |
| d | 4754 | 1.4% |
| t | 3460 | 1.0% |
| i | 3459 | 1.0% |
| g | 3071 | 0.9% |
| o | 2771 | 0.8% |
| n | 2580 | 0.8% |
| Other values (51) | 20440 | 6.0% |
Common
| Value | Count | Frequency (%) |
| 627624 | ||
| 3 | 283224 | |
| 0 | 248562 | 9.3% |
| 2 | 217698 | 8.2% |
| 1 | 210460 | 7.9% |
| 5 | 188524 | 7.1% |
| 4 | 185672 | 7.0% |
| - | 142824 | 5.4% |
| 8 | 115599 | 4.3% |
| 9 | 108329 | 4.1% |
| Other values (35) | 331523 |
Inherited
| Value | Count | Frequency (%) |
| ̊ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2995041 | |
| None | 5318 | 0.2% |
| Punctuation | 283 | < 0.1% |
| Modifier Letters | 37 | < 0.1% |
| Diacriticals | 1 | < 0.1% |
| Geometric Shapes | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 627624 | ||
| 3 | 283224 | |
| 0 | 248562 | 8.3% |
| N | 235749 | 7.9% |
| 2 | 217698 | 7.3% |
| 1 | 210460 | 7.0% |
| 5 | 188524 | 6.3% |
| 4 | 185672 | 6.2% |
| - | 142824 | 4.8% |
| 8 | 115599 | 3.9% |
| Other values (78) | 539105 |
None
| Value | Count | Frequency (%) |
| ° | 5018 | |
| º | 237 | 4.5% |
| ´ | 35 | 0.7% |
| á | 6 | 0.1% |
| é | 6 | 0.1% |
| ô | 4 | 0.1% |
| í | 4 | 0.1% |
| ó | 3 | 0.1% |
| ü | 2 | < 0.1% |
| ç | 2 | < 0.1% |
Punctuation
| Value | Count | Frequency (%) |
| ′ | 177 | |
| ″ | 54 | 19.1% |
| ” | 50 | 17.7% |
| – | 2 | 0.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 25 | |
| ˚ | 12 |
Diacriticals
| Value | Count | Frequency (%) |
| ̊ | 1 |
Geometric Shapes
| Value | Count | Frequency (%) |
| ◦ | 1 |
Missing 
| Distinct | 46810 |
|---|---|
| Distinct (%) | 14.6% |
| Missing | 3493424 |
| Missing (%) | 91.6% |
| Memory size | 29.1 MiB |
Length
| Max length | 68 |
|---|---|
| Median length | 11 |
| Mean length | 9.96988228 |
| Min length | 1 |
Unique
| Unique | 21397 ? |
|---|---|
| Unique (%) | 6.7% |
Sample
| 1st row | 079 41 27 W |
|---|---|
| 2nd row | -68.671977 |
| 3rd row | 016 25 -- E |
| 4th row | 076 55 55 W |
| 5th row | 59 39.00' W |
| Value | Count | Frequency (%) |
| w | 186725 | 19.8% |
| 60672 | 6.4% | |
| e | 53045 | 5.6% |
| 083 | 13556 | 1.4% |
| 30 | 9655 | 1.0% |
| 00 | 9463 | 1.0% |
| 077 | 9344 | 1.0% |
| 080 | 8478 | 0.9% |
| 081 | 8350 | 0.9% |
| 076 | 7865 | 0.8% |
| Other values (26677) | 577299 |
Most occurring characters
| Value | Count | Frequency (%) |
| 623777 | ||
| 0 | 408474 | |
| 1 | 243902 | 7.6% |
| W | 213465 | 6.7% |
| 3 | 189432 | 5.9% |
| 2 | 188925 | 5.9% |
| 5 | 186067 | 5.8% |
| 7 | 185084 | 5.8% |
| 8 | 180278 | 5.6% |
| 4 | 174743 | 5.5% |
| Other values (57) | 602945 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1995207 | |
| Space Separator | 623777 | 19.5% |
| Uppercase Letter | 287430 | 9.0% |
| Dash Punctuation | 171257 | 5.4% |
| Other Punctuation | 102184 | 3.2% |
| Lowercase Letter | 11674 | 0.4% |
| Other Symbol | 5009 | 0.2% |
| Other Letter | 232 | < 0.1% |
| Connector Punctuation | 82 | < 0.1% |
| Close Punctuation | 56 | < 0.1% |
| Other values (6) | 184 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 2999 | |
| e | 2981 | |
| g | 2925 | |
| n | 552 | 4.7% |
| o | 520 | 4.5% |
| t | 402 | 3.4% |
| i | 392 | 3.4% |
| u | 380 | 3.3% |
| r | 293 | 2.5% |
| s | 130 | 1.1% |
| Other values (7) | 100 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 78769 | |
| ' | 14571 | 14.3% |
| " | 4974 | 4.9% |
| ; | 3378 | 3.3% |
| ′ | 177 | 0.2% |
| ? | 92 | 0.1% |
| * | 73 | 0.1% |
| ″ | 54 | 0.1% |
| : | 43 | < 0.1% |
| , | 36 | < 0.1% |
| Other values (2) | 17 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 213465 | |
| E | 72880 | 25.4% |
| L | 522 | 0.2% |
| D | 164 | 0.1% |
| S | 115 | < 0.1% |
| N | 111 | < 0.1% |
| G | 82 | < 0.1% |
| O | 61 | < 0.1% |
| M | 27 | < 0.1% |
| T | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 408474 | |
| 1 | 243902 | |
| 3 | 189432 | |
| 2 | 188925 | |
| 5 | 186067 | |
| 7 | 185084 | |
| 8 | 180278 | |
| 4 | 174743 | |
| 6 | 131878 | 6.6% |
| 9 | 106424 | 5.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 171255 | |
| – | 2 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 5008 | |
| ◦ | 1 | < 0.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 35 | |
| ˚ | 12 | 25.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 35 | |
| ] | 21 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 33 | |
| [ | 20 |
Space Separator
| Value | Count | Frequency (%) |
| 623777 |
Other Letter
| Value | Count | Frequency (%) |
| º | 232 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 82 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 55 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 25 |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 3 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ̊ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2897755 | |
| Latin | 299336 | 9.4% |
| Inherited | 1 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 623777 | ||
| 0 | 408474 | |
| 1 | 243902 | 8.4% |
| 3 | 189432 | 6.5% |
| 2 | 188925 | 6.5% |
| 5 | 186067 | 6.4% |
| 7 | 185084 | 6.4% |
| 8 | 180278 | 6.2% |
| 4 | 174743 | 6.0% |
| - | 171255 | 5.9% |
| Other values (27) | 345818 |
Latin
| Value | Count | Frequency (%) |
| W | 213465 | |
| E | 72880 | 24.3% |
| d | 2999 | 1.0% |
| e | 2981 | 1.0% |
| g | 2925 | 1.0% |
| n | 552 | 0.2% |
| L | 522 | 0.2% |
| o | 520 | 0.2% |
| t | 402 | 0.1% |
| i | 392 | 0.1% |
| Other values (19) | 1698 | 0.6% |
Inherited
| Value | Count | Frequency (%) |
| ̊ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3191490 | |
| None | 5275 | 0.2% |
| Punctuation | 288 | < 0.1% |
| Modifier Letters | 37 | < 0.1% |
| Diacriticals | 1 | < 0.1% |
| Geometric Shapes | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 623777 | ||
| 0 | 408474 | |
| 1 | 243902 | 7.6% |
| W | 213465 | 6.7% |
| 3 | 189432 | 5.9% |
| 2 | 188925 | 5.9% |
| 5 | 186067 | 5.8% |
| 7 | 185084 | 5.8% |
| 8 | 180278 | 5.6% |
| 4 | 174743 | 5.5% |
| Other values (46) | 597343 |
None
| Value | Count | Frequency (%) |
| ° | 5008 | |
| º | 232 | 4.4% |
| ´ | 35 | 0.7% |
Punctuation
| Value | Count | Frequency (%) |
| ′ | 177 | |
| ” | 55 | 19.1% |
| ″ | 54 | 18.8% |
| – | 2 | 0.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 25 | |
| ˚ | 12 |
Diacriticals
| Value | Count | Frequency (%) |
| ̊ | 1 |
Geometric Shapes
| Value | Count | Frequency (%) |
| ◦ | 1 |
Missing 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3396655 |
| Missing (%) | 89.1% |
| Memory size | 29.1 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 22.71909526 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 413797 | |
| minutes | 403867 | |
| seconds | 403867 | |
| decimal | 9930 | 0.8% |
| township | 2873 | 0.2% |
| range | 2873 | 0.2% |
| utm | 296 | < 0.1% |
| marsden | 232 | < 0.1% |
| square | 232 | < 0.1% |
| unknown | 232 | < 0.1% |
| Other values (6) | 20 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2062392 | |
| s | 1224636 | |
| 820775 | 8.7% | |
| n | 814408 | 8.6% |
| g | 416670 | 4.4% |
| i | 416670 | 4.4% |
| r | 414261 | 4.4% |
| d | 414016 | 4.4% |
| D | 413822 | 4.4% |
| c | 413797 | 4.4% |
| Other values (26) | 2072503 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7434230 | |
| Uppercase Letter | 1228922 | 13.0% |
| Space Separator | 820775 | 8.7% |
| Decimal Number | 21 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2062392 | |
| s | 1224636 | |
| n | 814408 | 11.0% |
| g | 416670 | 5.6% |
| i | 416670 | 5.6% |
| r | 414261 | 5.6% |
| d | 414016 | 5.6% |
| c | 413797 | 5.6% |
| o | 406972 | 5.5% |
| u | 404099 | 5.4% |
| Other values (9) | 446309 | 6.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 413822 | |
| M | 404395 | |
| S | 404099 | |
| T | 3169 | 0.3% |
| R | 2873 | 0.2% |
| U | 540 | < 0.1% |
| A | 12 | < 0.1% |
| Q | 12 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15 | |
| 6 | 2 | 9.5% |
| 2 | 1 | 4.8% |
| 1 | 1 | 4.8% |
| 8 | 1 | 4.8% |
| 7 | 1 | 4.8% |
Space Separator
| Value | Count | Frequency (%) |
| 820775 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8663152 | |
| Common | 820798 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2062392 | |
| s | 1224636 | |
| n | 814408 | 9.4% |
| g | 416670 | 4.8% |
| i | 416670 | 4.8% |
| r | 414261 | 4.8% |
| d | 414016 | 4.8% |
| D | 413822 | 4.8% |
| c | 413797 | 4.8% |
| o | 406972 | 4.7% |
| Other values (17) | 1665508 |
Common
| Value | Count | Frequency (%) |
| 820775 | ||
| 0 | 15 | < 0.1% |
| 6 | 2 | < 0.1% |
| 2 | 1 | < 0.1% |
| . | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| - | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9483950 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2062392 | |
| s | 1224636 | |
| 820775 | 8.7% | |
| n | 814408 | 8.6% |
| g | 416670 | 4.4% |
| i | 416670 | 4.4% |
| r | 414261 | 4.4% |
| d | 414016 | 4.4% |
| D | 413822 | 4.4% |
| c | 413797 | 4.4% |
| Other values (26) | 2072503 |
verbatimSRS
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 6 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2700.0 |
|---|---|
| 2nd row | 1889-03-29 |
| Value | Count | Frequency (%) |
| 2700.0 | 1 | |
| 1889-03-29 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 2 | 2 | |
| 8 | 2 | |
| 9 | 2 | |
| - | 2 | |
| 7 | 1 | 6.2% |
| . | 1 | 6.2% |
| 1 | 1 | 6.2% |
| 3 | 1 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13 | |
| Dash Punctuation | 2 | 12.5% |
| Other Punctuation | 1 | 6.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 2 | 2 | |
| 8 | 2 | |
| 9 | 2 | |
| 7 | 1 | 7.7% |
| 1 | 1 | 7.7% |
| 3 | 1 | 7.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 2 | 2 | |
| 8 | 2 | |
| 9 | 2 | |
| - | 2 | |
| 7 | 1 | 6.2% |
| . | 1 | 6.2% |
| 1 | 1 | 6.2% |
| 3 | 1 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 2 | 2 | |
| 8 | 2 | |
| 9 | 2 | |
| - | 2 | |
| 7 | 1 | 6.2% |
| . | 1 | 6.2% |
| 1 | 1 | 6.2% |
| 3 | 1 | 6.2% |
footprintSRS
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 22.5 |
| Mean length | 22.5 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 88 |
|---|---|
| 2nd row | Animalia, Mollusca, Gastropoda, Hydrobiidae |
| Value | Count | Frequency (%) |
| 88 | 1 | |
| animalia | 1 | |
| mollusca | 1 | |
| gastropoda | 1 | |
| hydrobiidae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 4 | 8.9% |
| o | 4 | 8.9% |
| l | 3 | 6.7% |
| , | 3 | 6.7% |
| 3 | 6.7% | |
| d | 3 | 6.7% |
| 8 | 2 | 4.4% |
| s | 2 | 4.4% |
| r | 2 | 4.4% |
| Other values (13) | 13 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33 | |
| Uppercase Letter | 4 | 8.9% |
| Other Punctuation | 3 | 6.7% |
| Space Separator | 3 | 6.7% |
| Decimal Number | 2 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 4 | |
| o | 4 | |
| l | 3 | |
| d | 3 | |
| s | 2 | 6.1% |
| r | 2 | 6.1% |
| t | 1 | 3.0% |
| b | 1 | 3.0% |
| y | 1 | 3.0% |
| Other values (6) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 1 | |
| G | 1 | |
| A | 1 | |
| M | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37 | |
| Common | 8 | 17.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 4 | 10.8% |
| o | 4 | 10.8% |
| l | 3 | 8.1% |
| d | 3 | 8.1% |
| s | 2 | 5.4% |
| r | 2 | 5.4% |
| t | 1 | 2.7% |
| b | 1 | 2.7% |
| y | 1 | 2.7% |
| Other values (10) | 10 |
Common
| Value | Count | Frequency (%) |
| , | 3 | |
| 3 | ||
| 8 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 4 | 8.9% |
| o | 4 | 8.9% |
| l | 3 | 6.7% |
| , | 3 | 6.7% |
| 3 | 6.7% | |
| d | 3 | 6.7% |
| 8 | 2 | 4.4% |
| s | 2 | 4.4% |
| r | 2 | 4.4% |
| Other values (13) | 13 |
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814091 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 13 |
| Min length | 2 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Drosera sp. |
|---|---|
| 2nd row | 88 |
| 3rd row | Miconia coronata |
| 4th row | Boerhavia diffusa |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| drosera | 1 | 7.1% |
| sp | 1 | 7.1% |
| 88 | 1 | 7.1% |
| miconia | 1 | 7.1% |
| coronata | 1 | 7.1% |
| boerhavia | 1 | 7.1% |
| diffusa | 1 | 7.1% |
| animalia | 1 | 7.1% |
| myrcia | 1 | 7.1% |
| splendens | 1 | 7.1% |
| Other values (4) | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 14 | |
| i | 10 | 9.6% |
| s | 10 | 9.6% |
| r | 8 | 7.7% |
| n | 7 | 6.7% |
| e | 6 | 5.8% |
| 6 | 5.8% | |
| o | 5 | 4.8% |
| t | 4 | 3.8% |
| c | 4 | 3.8% |
| Other values (17) | 30 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 88 | |
| Uppercase Letter | 7 | 6.7% |
| Space Separator | 6 | 5.8% |
| Decimal Number | 2 | 1.9% |
| Other Punctuation | 1 | 1.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 14 | |
| i | 10 | |
| s | 10 | |
| r | 8 | |
| n | 7 | |
| e | 6 | 6.8% |
| o | 5 | 5.7% |
| t | 4 | 4.5% |
| c | 4 | 4.5% |
| u | 4 | 4.5% |
| Other values (9) | 16 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 2 | |
| M | 2 | |
| D | 1 | |
| A | 1 | |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 6 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 95 | |
| Common | 9 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 14 | |
| i | 10 | |
| s | 10 | |
| r | 8 | 8.4% |
| n | 7 | 7.4% |
| e | 6 | 6.3% |
| o | 5 | 5.3% |
| t | 4 | 4.2% |
| c | 4 | 4.2% |
| u | 4 | 4.2% |
| Other values (14) | 23 |
Common
| Value | Count | Frequency (%) |
| 6 | ||
| 8 | 2 | 22.2% |
| . | 1 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 14 | |
| i | 10 | 9.6% |
| s | 10 | 9.6% |
| r | 8 | 7.7% |
| n | 7 | 6.7% |
| e | 6 | 5.8% |
| 6 | 5.8% | |
| o | 5 | 4.8% |
| t | 4 | 3.8% |
| c | 4 | 3.8% |
| Other values (17) | 30 |
georeferencedBy
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1889 |
|---|---|
| 2nd row | Mollusca |
| Value | Count | Frequency (%) |
| 1889 | 1 | |
| mollusca | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 2 | |
| l | 2 | |
| 1 | 1 | |
| 9 | 1 | |
| M | 1 | |
| o | 1 | |
| u | 1 | |
| s | 1 | |
| c | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Decimal Number | 4 | |
| Uppercase Letter | 1 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2 | |
| o | 1 | |
| u | 1 | |
| s | 1 | |
| c | 1 | |
| a | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2 | |
| 1 | 1 | |
| 9 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 | |
| Common | 4 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 2 | |
| M | 1 | |
| o | 1 | |
| u | 1 | |
| s | 1 | |
| c | 1 | |
| a | 1 |
Common
| Value | Count | Frequency (%) |
| 8 | 2 | |
| 1 | 1 | |
| 9 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 2 | |
| l | 2 | |
| 1 | 1 | |
| 9 | 1 | |
| M | 1 | |
| o | 1 | |
| u | 1 | |
| s | 1 | |
| c | 1 | |
| a | 1 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 5.5 |
| Mean length | 5.5 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | Gastropoda |
| Value | Count | Frequency (%) |
| 3 | 1 | |
| gastropoda | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| 3 | 1 | |
| G | 1 | |
| s | 1 | |
| t | 1 | |
| r | 1 | |
| p | 1 | |
| d | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9 | |
| Decimal Number | 1 | 9.1% |
| Uppercase Letter | 1 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| s | 1 | |
| t | 1 | |
| r | 1 | |
| p | 1 | |
| d | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 | |
| Common | 1 | 9.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| G | 1 | |
| s | 1 | |
| t | 1 | |
| r | 1 | |
| p | 1 | |
| d | 1 |
Common
| Value | Count | Frequency (%) |
| 3 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| 3 | 1 | |
| G | 1 | |
| s | 1 | |
| t | 1 | |
| r | 1 | |
| p | 1 | |
| d | 1 |
Missing 
| Distinct | 2782 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 3320409 |
| Missing (%) | 87.1% |
| Memory size | 29.1 MiB |
Length
| Max length | 302 |
|---|---|
| Median length | 300 |
| Mean length | 25.53193705 |
| Min length | 2 |
Unique
| Unique | 851 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | unknown, from legacy |
|---|---|
| 2nd row | GEOLocate |
| 3rd row | ArcGIS software with data from New Mexico Resource Geographic Information System Program (http://rgis.unm.edu) and other inhouse resources (historical maps aiding with name changes), MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
| 4th row | Google Earth |
| 5th row | Alexandria Digital Library Gazetteer, MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
| Value | Count | Frequency (%) |
| from | 211518 | 13.0% |
| unknown | 208911 | 12.8% |
| legacy | 208034 | 12.7% |
| 88871 | 5.4% | |
| earth | 64975 | 4.0% |
| geolocate | 58622 | 3.6% |
| georeferencing | 56351 | 3.5% |
| manis/herpnet/ornis | 55304 | 3.4% |
| guidelines | 55299 | 3.4% |
| gazetteer | 32635 | 2.0% |
| Other values (3214) | 592554 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1139384 | 9.0% | |
| e | 1081446 | 8.6% |
| o | 916290 | 7.3% |
| n | 905258 | 7.2% |
| a | 730763 | 5.8% |
| r | 662416 | 5.3% |
| l | 440906 | 3.5% |
| g | 426015 | 3.4% |
| G | 400771 | 3.2% |
| c | 398496 | 3.2% |
| Other values (72) | 5503117 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8535644 | |
| Uppercase Letter | 1879803 | 14.9% |
| Space Separator | 1139384 | 9.0% |
| Other Punctuation | 544960 | 4.3% |
| Decimal Number | 396316 | 3.1% |
| Open Punctuation | 39838 | 0.3% |
| Close Punctuation | 39754 | 0.3% |
| Dash Punctuation | 28917 | 0.2% |
| Math Symbol | 138 | < 0.1% |
| Connector Punctuation | 108 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1081446 | |
| o | 916290 | 10.7% |
| n | 905258 | 10.6% |
| a | 730763 | 8.6% |
| r | 662416 | 7.8% |
| l | 440906 | 5.2% |
| g | 426015 | 5.0% |
| c | 398496 | 4.7% |
| u | 341516 | 4.0% |
| i | 334659 | 3.9% |
| Other values (17) | 2297879 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 400771 | |
| S | 220477 | |
| N | 208397 | |
| E | 184537 | |
| I | 137133 | 7.3% |
| O | 120049 | 6.4% |
| M | 110761 | 5.9% |
| T | 106408 | 5.7% |
| L | 77973 | 4.1% |
| R | 62436 | 3.3% |
| Other values (17) | 250861 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 325185 | |
| / | 122278 | 22.4% |
| : | 43075 | 7.9% |
| . | 40103 | 7.4% |
| ; | 6297 | 1.2% |
| ! | 3660 | 0.7% |
| # | 2675 | 0.5% |
| ' | 1093 | 0.2% |
| & | 548 | 0.1% |
| ? | 40 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 180526 | |
| 2 | 62657 | 15.8% |
| 1 | 56273 | 14.2% |
| 4 | 32987 | 8.3% |
| 5 | 16514 | 4.2% |
| 9 | 11759 | 3.0% |
| 7 | 10473 | 2.6% |
| 6 | 10339 | 2.6% |
| 3 | 9170 | 2.3% |
| 8 | 5618 | 1.4% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 136 | |
| = | 2 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1139384 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 39838 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 39754 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 28917 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 108 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10415447 | |
| Common | 2189415 | 17.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1081446 | 10.4% |
| o | 916290 | 8.8% |
| n | 905258 | 8.7% |
| a | 730763 | 7.0% |
| r | 662416 | 6.4% |
| l | 440906 | 4.2% |
| g | 426015 | 4.1% |
| G | 400771 | 3.8% |
| c | 398496 | 3.8% |
| u | 341516 | 3.3% |
| Other values (44) | 4111570 |
Common
| Value | Count | Frequency (%) |
| 1139384 | ||
| , | 325185 | 14.9% |
| 0 | 180526 | 8.2% |
| / | 122278 | 5.6% |
| 2 | 62657 | 2.9% |
| 1 | 56273 | 2.6% |
| : | 43075 | 2.0% |
| . | 40103 | 1.8% |
| ( | 39838 | 1.8% |
| ) | 39754 | 1.8% |
| Other values (18) | 140342 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12603228 | |
| None | 1634 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1139384 | 9.0% | |
| e | 1081446 | 8.6% |
| o | 916290 | 7.3% |
| n | 905258 | 7.2% |
| a | 730763 | 5.8% |
| r | 662416 | 5.3% |
| l | 440906 | 3.5% |
| g | 426015 | 3.4% |
| G | 400771 | 3.2% |
| c | 398496 | 3.2% |
| Other values (70) | 5501483 |
None
| Value | Count | Frequency (%) |
| í | 1633 | |
| Î | 1 | 0.1% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 29 Mar 1889 |
|---|
| Value | Count | Frequency (%) |
| 29 | 1 | |
| mar | 1 | |
| 1889 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 2 | ||
| 8 | 2 | |
| 2 | 1 | |
| M | 1 | |
| a | 1 | |
| r | 1 | |
| 1 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Space Separator | 2 | 18.2% |
| Lowercase Letter | 2 | 18.2% |
| Uppercase Letter | 1 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 8 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1 | |
| r | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 | |
| Latin | 3 | 27.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 2 | ||
| 8 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Latin
| Value | Count | Frequency (%) |
| M | 1 | |
| a | 1 | |
| r | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 2 | ||
| 8 | 2 | |
| 2 | 1 | |
| M | 1 | |
| a | 1 | |
| r | 1 | |
| 1 | 1 |
Missing 
| Distinct | 6364 |
|---|---|
| Distinct (%) | 7.6% |
| Missing | 3730205 |
| Missing (%) | 97.8% |
| Memory size | 29.1 MiB |
Length
| Max length | 182 |
|---|---|
| Median length | 126 |
| Mean length | 21.79519394 |
| Min length | 1 |
Unique
| Unique | 3216 ? |
|---|---|
| Unique (%) | 3.8% |
Sample
| 1st row | Locality extent = 400 m |
|---|---|
| 2nd row | Locality extent = 0.6 |
| 3rd row | Locality extent = 1.059 mi. |
| 4th row | Locality extent = 800 m |
| 5th row | Coordinate Uncertainty In Meters: 44967 |
| Value | Count | Frequency (%) |
| locality | 55561 | |
| 55391 | ||
| extent | 55332 | |
| mi | 16479 | 4.9% |
| ca | 7757 | 2.3% |
| km | 4763 | 1.4% |
| approximate | 4046 | 1.2% |
| in | 3776 | 1.1% |
| coordinate | 3445 | 1.0% |
| meters | 3433 | 1.0% |
| Other values (6286) | 124350 |
Most occurring characters
| Value | Count | Frequency (%) |
| 250439 | 13.7% | |
| t | 206872 | 11.3% |
| e | 156329 | 8.5% |
| a | 99262 | 5.4% |
| i | 95332 | 5.2% |
| o | 87887 | 4.8% |
| n | 86284 | 4.7% |
| l | 68326 | 3.7% |
| c | 64373 | 3.5% |
| . | 63081 | 3.4% |
| Other values (73) | 650301 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1145629 | |
| Space Separator | 250439 | 13.7% |
| Decimal Number | 175491 | 9.6% |
| Uppercase Letter | 127324 | 7.0% |
| Other Punctuation | 72826 | 4.0% |
| Math Symbol | 55339 | 3.0% |
| Dash Punctuation | 866 | < 0.1% |
| Open Punctuation | 285 | < 0.1% |
| Close Punctuation | 285 | < 0.1% |
| Initial Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 206872 | |
| e | 156329 | |
| a | 99262 | |
| i | 95332 | |
| o | 87887 | |
| n | 86284 | |
| l | 68326 | 6.0% |
| c | 64373 | 5.6% |
| y | 61729 | 5.4% |
| x | 60958 | 5.3% |
| Other values (17) | 158277 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 55887 | |
| C | 14147 | 11.1% |
| A | 8746 | 6.9% |
| M | 4777 | 3.8% |
| I | 4295 | 3.4% |
| G | 4023 | 3.2% |
| P | 3673 | 2.9% |
| U | 3576 | 2.8% |
| D | 3420 | 2.7% |
| S | 3225 | 2.5% |
| Other values (16) | 21555 | 16.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 33298 | |
| 1 | 29145 | |
| 5 | 23499 | |
| 2 | 21435 | |
| 3 | 16974 | |
| 6 | 13385 | |
| 4 | 11027 | 6.3% |
| 7 | 10595 | 6.0% |
| 8 | 9404 | 5.4% |
| 9 | 6729 | 3.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 63081 | |
| : | 3636 | 5.0% |
| , | 2559 | 3.5% |
| ; | 2554 | 3.5% |
| / | 805 | 1.1% |
| ' | 154 | 0.2% |
| " | 20 | < 0.1% |
| & | 9 | < 0.1% |
| # | 8 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 55316 | |
| + | 22 | < 0.1% |
| ± | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 276 | |
| [ | 9 | 3.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 276 | |
| ] | 9 | 3.2% |
Space Separator
| Value | Count | Frequency (%) |
| 250439 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 866 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1272953 | |
| Common | 555533 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 206872 | |
| e | 156329 | |
| a | 99262 | 7.8% |
| i | 95332 | 7.5% |
| o | 87887 | 6.9% |
| n | 86284 | 6.8% |
| l | 68326 | 5.4% |
| c | 64373 | 5.1% |
| y | 61729 | 4.8% |
| x | 60958 | 4.8% |
| Other values (43) | 285601 |
Common
| Value | Count | Frequency (%) |
| 250439 | ||
| . | 63081 | 11.4% |
| = | 55316 | 10.0% |
| 0 | 33298 | 6.0% |
| 1 | 29145 | 5.2% |
| 5 | 23499 | 4.2% |
| 2 | 21435 | 3.9% |
| 3 | 16974 | 3.1% |
| 6 | 13385 | 2.4% |
| 4 | 11027 | 2.0% |
| Other values (20) | 37934 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1828481 | |
| None | 3 | < 0.1% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 250439 | 13.7% | |
| t | 206872 | 11.3% |
| e | 156329 | 8.5% |
| a | 99262 | 5.4% |
| i | 95332 | 5.2% |
| o | 87887 | 4.8% |
| n | 86284 | 4.7% |
| l | 68326 | 3.7% |
| c | 64373 | 3.5% |
| . | 63081 | 3.4% |
| Other values (69) | 650296 |
None
| Value | Count | Frequency (%) |
| ñ | 2 | |
| ± | 1 |
Punctuation
| Value | Count | Frequency (%) |
| “ | 1 | |
| ” | 1 |
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814092 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 43 |
| Mean length | 45.28571429 |
| Min length | 28 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North America, United States, California |
|---|---|
| 2nd row | North America, United States, Oklahoma, Pontotoc County |
| 3rd row | North America, United States, Alaska |
| 4th row | North America, United States, Massachusetts |
| 5th row | North America, United States, Arizona, Cochise |
| Value | Count | Frequency (%) |
| north | 7 | |
| united | 7 | |
| states | 7 | |
| america | 6 | |
| county | 2 | 5.0% |
| massachusetts | 2 | 5.0% |
| california | 1 | 2.5% |
| oklahoma | 1 | 2.5% |
| pontotoc | 1 | 2.5% |
| alaska | 1 | 2.5% |
| Other values (5) | 5 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 39 | |
| 33 | 10.4% | |
| a | 28 | 8.8% |
| e | 25 | 7.9% |
| s | 18 | 5.7% |
| i | 18 | 5.7% |
| o | 16 | 5.0% |
| r | 16 | 5.0% |
| , | 16 | 5.0% |
| n | 15 | 4.7% |
| Other values (20) | 93 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 228 | |
| Uppercase Letter | 40 | 12.6% |
| Space Separator | 33 | 10.4% |
| Other Punctuation | 16 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 39 | |
| a | 28 | |
| e | 25 | |
| s | 18 | |
| i | 18 | |
| o | 16 | |
| r | 16 | |
| n | 15 | 6.6% |
| c | 12 | 5.3% |
| h | 11 | 4.8% |
| Other values (9) | 30 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 9 | |
| N | 7 | |
| S | 7 | |
| U | 7 | |
| C | 4 | |
| O | 2 | 5.0% |
| M | 2 | 5.0% |
| P | 1 | 2.5% |
| B | 1 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 33 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 268 | |
| Common | 49 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 39 | |
| a | 28 | 10.4% |
| e | 25 | 9.3% |
| s | 18 | 6.7% |
| i | 18 | 6.7% |
| o | 16 | 6.0% |
| r | 16 | 6.0% |
| n | 15 | 5.6% |
| c | 12 | 4.5% |
| h | 11 | 4.1% |
| Other values (18) | 70 |
Common
| Value | Count | Frequency (%) |
| 33 | ||
| , | 16 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 317 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 39 | |
| 33 | 10.4% | |
| a | 28 | 8.8% |
| e | 25 | 7.9% |
| s | 18 | 5.7% |
| i | 18 | 5.7% |
| o | 16 | 5.0% |
| r | 16 | 5.0% |
| , | 16 | 5.0% |
| n | 15 | 4.7% |
| Other values (20) | 93 |
earliestEonOrLowestEonothem
Text
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 61.5% |
| Missing | 3814086 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 67 |
|---|---|
| Median length | 55 |
| Mean length | 32.61538462 |
| Min length | 13 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 53.8% |
Sample
| 1st row | North America |
|---|---|
| 2nd row | Plantae, Dicotyledonae, Caryophyllales, Droseraceae |
| 3rd row | North America |
| 4th row | North America |
| 5th row | North America |
| Value | Count | Frequency (%) |
| north | 7 | |
| america | 6 | |
| plantae | 5 | |
| dicotyledonae | 5 | |
| caryophyllales | 2 | 4.8% |
| myrtales | 2 | 4.8% |
| myrtoideae | 1 | 2.4% |
| fagales | 1 | 2.4% |
| buprestidae | 1 | 2.4% |
| coleoptera | 1 | 2.4% |
| Other values (11) | 11 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 56 | |
| e | 49 | |
| t | 32 | 7.5% |
| 29 | 6.8% | |
| o | 28 | 6.6% |
| r | 26 | 6.1% |
| l | 24 | 5.7% |
| , | 21 | 5.0% |
| c | 20 | 4.7% |
| i | 19 | 4.5% |
| Other values (19) | 120 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 332 | |
| Uppercase Letter | 42 | 9.9% |
| Space Separator | 29 | 6.8% |
| Other Punctuation | 21 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 56 | |
| e | 49 | |
| t | 32 | |
| o | 28 | |
| r | 26 | |
| l | 24 | |
| c | 20 | 6.0% |
| i | 19 | 5.7% |
| n | 16 | 4.8% |
| y | 14 | 4.2% |
| Other values (7) | 48 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 9 | |
| N | 8 | |
| D | 6 | |
| M | 6 | |
| P | 5 | |
| C | 4 | |
| O | 1 | 2.4% |
| I | 1 | 2.4% |
| B | 1 | 2.4% |
| F | 1 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| 29 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 374 | |
| Common | 50 | 11.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 56 | |
| e | 49 | |
| t | 32 | 8.6% |
| o | 28 | 7.5% |
| r | 26 | 7.0% |
| l | 24 | 6.4% |
| c | 20 | 5.3% |
| i | 19 | 5.1% |
| n | 16 | 4.3% |
| y | 14 | 3.7% |
| Other values (17) | 90 |
Common
| Value | Count | Frequency (%) |
| 29 | ||
| , | 21 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 424 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 56 | |
| e | 49 | |
| t | 32 | 7.5% |
| 29 | 6.8% | |
| o | 28 | 6.6% |
| r | 26 | 6.1% |
| l | 24 | 5.7% |
| , | 21 | 5.0% |
| c | 20 | 4.7% |
| i | 19 | 4.5% |
| Other values (19) | 120 |
latestEonOrHighestEonothem
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 3814091 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 7 |
| Mean length | 9.375 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 37.5% |
Sample
| 1st row | Plantae |
|---|---|
| 2nd row | Earle, S. A. |
| 3rd row | Plantae |
| 4th row | North Atlantic Ocean |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| plantae | 5 | |
| earle | 1 | 8.3% |
| s | 1 | 8.3% |
| a | 1 | 8.3% |
| north | 1 | 8.3% |
| atlantic | 1 | 8.3% |
| ocean | 1 | 8.3% |
| animalia | 1 | 8.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 15 | |
| n | 8 | |
| t | 8 | |
| l | 8 | |
| e | 7 | |
| P | 5 | 6.7% |
| 4 | 5.3% | |
| A | 3 | 4.0% |
| i | 3 | 4.0% |
| r | 2 | 2.7% |
| Other values (10) | 12 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 56 | |
| Uppercase Letter | 12 | 16.0% |
| Space Separator | 4 | 5.3% |
| Other Punctuation | 3 | 4.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 15 | |
| n | 8 | |
| t | 8 | |
| l | 8 | |
| e | 7 | |
| i | 3 | 5.4% |
| r | 2 | 3.6% |
| c | 2 | 3.6% |
| h | 1 | 1.8% |
| o | 1 | 1.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 5 | |
| A | 3 | |
| O | 1 | 8.3% |
| S | 1 | 8.3% |
| N | 1 | 8.3% |
| E | 1 | 8.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 | |
| , | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 68 | |
| Common | 7 | 9.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 15 | |
| n | 8 | |
| t | 8 | |
| l | 8 | |
| e | 7 | |
| P | 5 | 7.4% |
| A | 3 | 4.4% |
| i | 3 | 4.4% |
| r | 2 | 2.9% |
| c | 2 | 2.9% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| . | 2 | |
| , | 1 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 75 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 15 | |
| n | 8 | |
| t | 8 | |
| l | 8 | |
| e | 7 | |
| P | 5 | 6.7% |
| 4 | 5.3% | |
| A | 3 | 4.0% |
| i | 3 | 4.0% |
| r | 2 | 2.7% |
| Other values (10) | 12 |
earliestEraOrLowestErathem
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814096 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 10.33333333 |
| Min length | 10 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1935-06-26 |
|---|---|
| 2nd row | Fluminicola |
| 3rd row | Arthropoda |
| Value | Count | Frequency (%) |
| 1935-06-26 | 1 | |
| fluminicola | 1 | |
| arthropoda | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 3 | 9.7% |
| - | 2 | 6.5% |
| 6 | 2 | 6.5% |
| r | 2 | 6.5% |
| l | 2 | 6.5% |
| a | 2 | 6.5% |
| i | 2 | 6.5% |
| 1 | 1 | 3.2% |
| c | 1 | 3.2% |
| p | 1 | 3.2% |
| Other values (13) | 13 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19 | |
| Decimal Number | 8 | |
| Dash Punctuation | 2 | 6.5% |
| Uppercase Letter | 2 | 6.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3 | |
| r | 2 | |
| l | 2 | |
| a | 2 | |
| i | 2 | |
| c | 1 | 5.3% |
| p | 1 | 5.3% |
| h | 1 | 5.3% |
| t | 1 | 5.3% |
| m | 1 | 5.3% |
| Other values (3) | 3 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 1 | 1 | |
| 9 | 1 | |
| 2 | 1 | |
| 0 | 1 | |
| 5 | 1 | |
| 3 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 | |
| F | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21 | |
| Common | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3 | |
| r | 2 | 9.5% |
| l | 2 | 9.5% |
| a | 2 | 9.5% |
| i | 2 | 9.5% |
| c | 1 | 4.8% |
| p | 1 | 4.8% |
| h | 1 | 4.8% |
| t | 1 | 4.8% |
| A | 1 | 4.8% |
| Other values (5) | 5 |
Common
| Value | Count | Frequency (%) |
| - | 2 | |
| 6 | 2 | |
| 1 | 1 | |
| 9 | 1 | |
| 2 | 1 | |
| 0 | 1 | |
| 5 | 1 | |
| 3 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 3 | 9.7% |
| - | 2 | 6.5% |
| 6 | 2 | 6.5% |
| r | 2 | 6.5% |
| l | 2 | 6.5% |
| a | 2 | 6.5% |
| i | 2 | 6.5% |
| 1 | 1 | 3.2% |
| c | 1 | 3.2% |
| p | 1 | 3.2% |
| Other values (13) | 13 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 33.3% |
| Missing | 3814093 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 16.7% |
Sample
| 1st row | Dicotyledonae |
|---|---|
| 2nd row | Dicotyledonae |
| 3rd row | Dicotyledonae |
| 4th row | Dicotyledonae |
| 5th row | Insecta |
| Value | Count | Frequency (%) |
| dicotyledonae | 5 | |
| insecta | 1 | 16.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 11 | |
| o | 10 | |
| c | 6 | |
| t | 6 | |
| n | 6 | |
| a | 6 | |
| D | 5 | |
| i | 5 | |
| y | 5 | |
| l | 5 | |
| Other values (3) | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 66 | |
| Uppercase Letter | 6 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11 | |
| o | 10 | |
| c | 6 | |
| t | 6 | |
| n | 6 | |
| a | 6 | |
| i | 5 | |
| y | 5 | |
| l | 5 | |
| d | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 5 | |
| I | 1 | 16.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 72 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11 | |
| o | 10 | |
| c | 6 | |
| t | 6 | |
| n | 6 | |
| a | 6 | |
| D | 5 | |
| i | 5 | |
| y | 5 | |
| l | 5 | |
| Other values (3) | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 72 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 11 | |
| o | 10 | |
| c | 6 | |
| t | 6 | |
| n | 6 | |
| a | 6 | |
| D | 5 | |
| i | 5 | |
| y | 5 | |
| l | 5 | |
| Other values (3) | 7 |
earliestPeriodOrLowestSystem
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 42.9% |
| Missing | 3814085 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 13.5 |
| Mean length | 11.07142857 |
| Min length | 3 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 21.4% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | Caryophyllales |
| 3rd row | United States |
| 4th row | United States |
| 5th row | United States |
| Value | Count | Frequency (%) |
| united | 7 | |
| states | 7 | |
| caryophyllales | 2 | 9.5% |
| myrtales | 2 | 9.5% |
| 177 | 1 | 4.8% |
| coleoptera | 1 | 4.8% |
| fagales | 1 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 24 | |
| e | 21 | |
| a | 16 | |
| s | 12 | 7.7% |
| l | 10 | 6.5% |
| U | 7 | 4.5% |
| i | 7 | 4.5% |
| d | 7 | 4.5% |
| 7 | 4.5% | |
| S | 7 | 4.5% |
| Other values (12) | 37 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 125 | |
| Uppercase Letter | 20 | 12.9% |
| Space Separator | 7 | 4.5% |
| Decimal Number | 3 | 1.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 24 | |
| e | 21 | |
| a | 16 | |
| s | 12 | |
| l | 10 | |
| i | 7 | 5.6% |
| d | 7 | 5.6% |
| n | 7 | 5.6% |
| y | 6 | 4.8% |
| r | 5 | 4.0% |
| Other values (4) | 10 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 7 | |
| S | 7 | |
| C | 3 | |
| M | 2 | 10.0% |
| F | 1 | 5.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 2 | |
| 1 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 145 | |
| Common | 10 | 6.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 24 | |
| e | 21 | |
| a | 16 | |
| s | 12 | |
| l | 10 | 6.9% |
| U | 7 | 4.8% |
| i | 7 | 4.8% |
| d | 7 | 4.8% |
| S | 7 | 4.8% |
| n | 7 | 4.8% |
| Other values (9) | 27 |
Common
| Value | Count | Frequency (%) |
| 7 | ||
| 7 | 2 | 20.0% |
| 1 | 1 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 155 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 24 | |
| e | 21 | |
| a | 16 | |
| s | 12 | 7.7% |
| l | 10 | 6.5% |
| U | 7 | 4.5% |
| i | 7 | 4.5% |
| d | 7 | 4.5% |
| 7 | 4.5% | |
| S | 7 | 4.5% |
| Other values (12) | 37 |
latestPeriodOrHighestSystem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 177 |
|---|
| Value | Count | Frequency (%) |
| 177 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 2 | |
| 1 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 2 | |
| 1 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 2 | |
| 1 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 2 | |
| 1 | 1 |
earliestEpochOrLowestSeries
Text
Missing 
| Distinct | 13 |
|---|---|
| Distinct (%) | 92.9% |
| Missing | 3814085 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 10.5 |
| Mean length | 9.714285714 |
| Min length | 3 |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | 85.7% |
Sample
| 1st row | California |
|---|---|
| 2nd row | Droseraceae |
| 3rd row | Oklahoma |
| 4th row | Alaska |
| 5th row | Massachusetts |
| Value | Count | Frequency (%) |
| massachusetts | 2 | |
| california | 1 | 7.1% |
| droseraceae | 1 | 7.1% |
| oklahoma | 1 | 7.1% |
| alaska | 1 | 7.1% |
| arizona | 1 | 7.1% |
| melastomataceae | 1 | 7.1% |
| 1935 | 1 | 7.1% |
| nyctaginaceae | 1 | 7.1% |
| sp | 1 | 7.1% |
| Other values (3) | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 27 | |
| e | 16 | |
| s | 14 | 10.3% |
| t | 9 | 6.6% |
| c | 8 | 5.9% |
| r | 7 | 5.1% |
| i | 6 | 4.4% |
| o | 5 | 3.7% |
| n | 4 | 2.9% |
| M | 4 | 2.9% |
| Other values (22) | 36 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 119 | |
| Uppercase Letter | 12 | 8.8% |
| Decimal Number | 4 | 2.9% |
| Other Punctuation | 1 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 27 | |
| e | 16 | |
| s | 14 | |
| t | 9 | 7.6% |
| c | 8 | 6.7% |
| r | 7 | 5.9% |
| i | 6 | 5.0% |
| o | 5 | 4.2% |
| n | 4 | 3.4% |
| l | 4 | 3.4% |
| Other values (10) | 19 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 4 | |
| C | 2 | |
| A | 2 | |
| B | 1 | 8.3% |
| N | 1 | 8.3% |
| O | 1 | 8.3% |
| D | 1 | 8.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 | |
| 3 | 1 | |
| 9 | 1 | |
| 1 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 131 | |
| Common | 5 | 3.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 27 | |
| e | 16 | |
| s | 14 | |
| t | 9 | 6.9% |
| c | 8 | 6.1% |
| r | 7 | 5.3% |
| i | 6 | 4.6% |
| o | 5 | 3.8% |
| n | 4 | 3.1% |
| M | 4 | 3.1% |
| Other values (17) | 31 |
Common
| Value | Count | Frequency (%) |
| 5 | 1 | |
| . | 1 | |
| 3 | 1 | |
| 9 | 1 | |
| 1 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 136 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 27 | |
| e | 16 | |
| s | 14 | 10.3% |
| t | 9 | 6.6% |
| c | 8 | 5.9% |
| r | 7 | 5.1% |
| i | 6 | 4.4% |
| o | 5 | 3.7% |
| n | 4 | 2.9% |
| M | 4 | 2.9% |
| Other values (22) | 36 |
latestEpochOrHighestSeries
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814094 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 58 |
|---|---|
| Median length | 15 |
| Mean length | 19.6 |
| Min length | 1 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Pontotoc County |
|---|---|
| 2nd row | North America, Mexico, Baja California Norte, Guadalupe I. |
| 3rd row | Cochise |
| 4th row | Barnstable County |
| 5th row | 6 |
| Value | Count | Frequency (%) |
| county | 2 | |
| pontotoc | 1 | 7.1% |
| north | 1 | 7.1% |
| america | 1 | 7.1% |
| mexico | 1 | 7.1% |
| baja | 1 | 7.1% |
| california | 1 | 7.1% |
| norte | 1 | 7.1% |
| guadalupe | 1 | 7.1% |
| i | 1 | 7.1% |
| Other values (3) | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 10 | 10.2% |
| a | 9 | 9.2% |
| 9 | 9.2% | |
| t | 7 | 7.1% |
| e | 6 | 6.1% |
| n | 5 | 5.1% |
| r | 5 | 5.1% |
| i | 5 | 5.1% |
| c | 4 | 4.1% |
| C | 4 | 4.1% |
| Other values (22) | 34 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 71 | |
| Uppercase Letter | 13 | 13.3% |
| Space Separator | 9 | 9.2% |
| Other Punctuation | 4 | 4.1% |
| Decimal Number | 1 | 1.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 10 | |
| a | 9 | |
| t | 7 | |
| e | 6 | |
| n | 5 | 7.0% |
| r | 5 | 7.0% |
| i | 5 | 7.0% |
| c | 4 | 5.6% |
| u | 4 | 5.6% |
| l | 3 | 4.2% |
| Other values (10) | 13 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 4 | |
| N | 2 | |
| B | 2 | |
| I | 1 | 7.7% |
| G | 1 | 7.7% |
| P | 1 | 7.7% |
| M | 1 | 7.7% |
| A | 1 | 7.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3 | |
| . | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 9 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 84 | |
| Common | 14 | 14.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 10 | 11.9% |
| a | 9 | 10.7% |
| t | 7 | 8.3% |
| e | 6 | 7.1% |
| n | 5 | 6.0% |
| r | 5 | 6.0% |
| i | 5 | 6.0% |
| c | 4 | 4.8% |
| C | 4 | 4.8% |
| u | 4 | 4.8% |
| Other values (18) | 25 |
Common
| Value | Count | Frequency (%) |
| 9 | ||
| , | 3 | 21.4% |
| . | 1 | 7.1% |
| 6 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 98 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 10 | 10.2% |
| a | 9 | 9.2% |
| 9 | 9.2% | |
| t | 7 | 7.1% |
| e | 6 | 6.1% |
| n | 5 | 5.1% |
| r | 5 | 5.1% |
| i | 5 | 5.1% |
| c | 4 | 4.1% |
| C | 4 | 4.1% |
| Other values (22) | 34 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7.5 |
| Mean length | 7.5 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North America |
|---|---|
| 2nd row | 26 |
| Value | Count | Frequency (%) |
| north | 1 | |
| america | 1 | |
| 26 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2 | |
| N | 1 | 6.7% |
| o | 1 | 6.7% |
| t | 1 | 6.7% |
| h | 1 | 6.7% |
| 1 | 6.7% | |
| A | 1 | 6.7% |
| m | 1 | 6.7% |
| e | 1 | 6.7% |
| i | 1 | 6.7% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 | |
| Uppercase Letter | 2 | 13.3% |
| Decimal Number | 2 | 13.3% |
| Space Separator | 1 | 6.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2 | |
| o | 1 | |
| t | 1 | |
| h | 1 | |
| m | 1 | |
| e | 1 | |
| i | 1 | |
| c | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 | |
| A | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 6 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 | |
| Common | 3 | 20.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2 | |
| N | 1 | |
| o | 1 | |
| t | 1 | |
| h | 1 | |
| A | 1 | |
| m | 1 | |
| e | 1 | |
| i | 1 | |
| c | 1 |
Common
| Value | Count | Frequency (%) |
| 1 | ||
| 2 | 1 | |
| 6 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2 | |
| N | 1 | 6.7% |
| o | 1 | 6.7% |
| t | 1 | 6.7% |
| h | 1 | 6.7% |
| 1 | 6.7% | |
| A | 1 | 6.7% |
| m | 1 | 6.7% |
| e | 1 | 6.7% |
| i | 1 | 6.7% |
| Other values (4) | 4 |
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 85.7% |
| Missing | 3814092 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 49 |
|---|---|
| Median length | 13 |
| Mean length | 14.71428571 |
| Min length | 3 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 71.4% |
Sample
| 1st row | San Francisco |
|---|---|
| 2nd row | Ada |
| 3rd row | Seldovia |
| 4th row | Scharf, U. |
| 5th row | Woods Hole |
| Value | Count | Frequency (%) |
| woods | 2 | |
| hole | 2 | |
| san | 1 | 6.2% |
| francisco | 1 | 6.2% |
| ada | 1 | 6.2% |
| seldovia | 1 | 6.2% |
| scharf | 1 | 6.2% |
| u | 1 | 6.2% |
| chiricahua | 1 | 6.2% |
| mountains | 1 | 6.2% |
| Other values (4) | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 13 | 12.6% |
| a | 10 | 9.7% |
| 9 | 8.7% | |
| s | 6 | 5.8% |
| n | 6 | 5.8% |
| r | 5 | 4.9% |
| l | 5 | 4.9% |
| i | 5 | 4.9% |
| d | 4 | 3.9% |
| c | 4 | 3.9% |
| Other values (20) | 36 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 75 | |
| Uppercase Letter | 14 | 13.6% |
| Space Separator | 9 | 8.7% |
| Other Punctuation | 5 | 4.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 13 | |
| a | 10 | |
| s | 6 | |
| n | 6 | |
| r | 5 | 6.7% |
| l | 5 | 6.7% |
| i | 5 | 6.7% |
| d | 4 | 5.3% |
| c | 4 | 5.3% |
| h | 3 | 4.0% |
| Other values (7) | 14 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3 | |
| W | 2 | |
| H | 2 | |
| P | 1 | 7.1% |
| M | 1 | 7.1% |
| B | 1 | 7.1% |
| A | 1 | 7.1% |
| C | 1 | 7.1% |
| U | 1 | 7.1% |
| F | 1 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3 | |
| . | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 9 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 89 | |
| Common | 14 | 13.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 13 | |
| a | 10 | 11.2% |
| s | 6 | 6.7% |
| n | 6 | 6.7% |
| r | 5 | 5.6% |
| l | 5 | 5.6% |
| i | 5 | 5.6% |
| d | 4 | 4.5% |
| c | 4 | 4.5% |
| S | 3 | 3.4% |
| Other values (17) | 28 |
Common
| Value | Count | Frequency (%) |
| 9 | ||
| , | 3 | 21.4% |
| . | 2 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 103 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 13 | 12.6% |
| a | 10 | 9.7% |
| 9 | 8.7% | |
| s | 6 | 5.8% |
| n | 6 | 5.8% |
| r | 5 | 4.9% |
| l | 5 | 4.9% |
| i | 5 | 4.9% |
| d | 4 | 3.9% |
| c | 4 | 3.9% |
| Other values (20) | 36 |
lowestBiostratigraphicZone
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814093 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.833333333 |
| Min length | 6 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Drosera |
|---|---|
| 2nd row | Miconia |
| 3rd row | Boerhavia |
| 4th row | Myrcia |
| 5th row | Buprestis |
| Value | Count | Frequency (%) |
| drosera | 1 | |
| miconia | 1 | |
| boerhavia | 1 | |
| myrcia | 1 | |
| buprestis | 1 | |
| casuarina | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 6 | |
| r | 6 | |
| s | 4 | |
| o | 3 | 6.4% |
| e | 3 | 6.4% |
| n | 2 | 4.3% |
| u | 2 | 4.3% |
| M | 2 | 4.3% |
| c | 2 | 4.3% |
| Other values (8) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 41 | |
| Uppercase Letter | 6 | 12.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 6 | |
| r | 6 | |
| s | 4 | |
| o | 3 | 7.3% |
| e | 3 | 7.3% |
| n | 2 | 4.9% |
| u | 2 | 4.9% |
| c | 2 | 4.9% |
| t | 1 | 2.4% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2 | |
| B | 2 | |
| D | 1 | |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 47 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 6 | |
| r | 6 | |
| s | 4 | |
| o | 3 | 6.4% |
| e | 3 | 6.4% |
| n | 2 | 4.3% |
| u | 2 | 4.3% |
| M | 2 | 4.3% |
| c | 2 | 4.3% |
| Other values (8) | 9 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 6 | |
| r | 6 | |
| s | 4 | |
| o | 3 | 6.4% |
| e | 3 | 6.4% |
| n | 2 | 4.3% |
| u | 2 | 4.3% |
| M | 2 | 4.3% |
| c | 2 | 4.3% |
| Other values (8) | 9 |
highestBiostratigraphicZone
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 6 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Guadalupe I. |
|---|---|
| 2nd row | 2438.0 |
| Value | Count | Frequency (%) |
| guadalupe | 1 | |
| i | 1 | |
| 2438.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 2 | 11.1% |
| a | 2 | 11.1% |
| . | 2 | 11.1% |
| G | 1 | 5.6% |
| d | 1 | 5.6% |
| l | 1 | 5.6% |
| p | 1 | 5.6% |
| e | 1 | 5.6% |
| 1 | 5.6% | |
| I | 1 | 5.6% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8 | |
| Decimal Number | 5 | |
| Other Punctuation | 2 | 11.1% |
| Uppercase Letter | 2 | 11.1% |
| Space Separator | 1 | 5.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 2 | |
| a | 2 | |
| d | 1 | |
| l | 1 | |
| p | 1 | |
| e | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 4 | 1 | |
| 3 | 1 | |
| 8 | 1 | |
| 0 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 | |
| I | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 | |
| Common | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 2 | |
| a | 2 | |
| G | 1 | |
| d | 1 | |
| l | 1 | |
| p | 1 | |
| e | 1 | |
| I | 1 |
Common
| Value | Count | Frequency (%) |
| . | 2 | |
| 1 | ||
| 2 | 1 | |
| 4 | 1 | |
| 3 | 1 | |
| 8 | 1 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 2 | 11.1% |
| a | 2 | 11.1% |
| . | 2 | 11.1% |
| G | 1 | 5.6% |
| d | 1 | 5.6% |
| l | 1 | 5.6% |
| p | 1 | 5.6% |
| e | 1 | 5.6% |
| 1 | 5.6% | |
| I | 1 | 5.6% |
| Other values (5) | 5 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 6 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Campanula rotundifolia |
|---|---|
| 2nd row | Mexico |
| Value | Count | Frequency (%) |
| campanula | 1 | |
| rotundifolia | 1 | |
| mexico | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| o | 3 | 10.7% |
| i | 3 | 10.7% |
| n | 2 | 7.1% |
| u | 2 | 7.1% |
| l | 2 | 7.1% |
| d | 1 | 3.6% |
| x | 1 | 3.6% |
| e | 1 | 3.6% |
| M | 1 | 3.6% |
| Other values (8) | 8 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25 | |
| Uppercase Letter | 2 | 7.1% |
| Space Separator | 1 | 3.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| o | 3 | |
| i | 3 | |
| n | 2 | 8.0% |
| u | 2 | 8.0% |
| l | 2 | 8.0% |
| d | 1 | 4.0% |
| x | 1 | 4.0% |
| e | 1 | 4.0% |
| f | 1 | 4.0% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 | |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27 | |
| Common | 1 | 3.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| o | 3 | |
| i | 3 | |
| n | 2 | 7.4% |
| u | 2 | 7.4% |
| l | 2 | 7.4% |
| d | 1 | 3.7% |
| x | 1 | 3.7% |
| e | 1 | 3.7% |
| M | 1 | 3.7% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| o | 3 | 10.7% |
| i | 3 | 10.7% |
| n | 2 | 7.1% |
| u | 2 | 7.1% |
| l | 2 | 7.1% |
| d | 1 | 3.6% |
| x | 1 | 3.6% |
| e | 1 | 3.6% |
| M | 1 | 3.6% |
| Other values (8) | 8 |
formation
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814092 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 9 |
| Mean length | 8.857142857 |
| Min length | 3 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | sp. |
|---|---|
| 2nd row | Baja California Norte |
| 3rd row | coronata |
| 4th row | diffusa |
| 5th row | splendens |
| Value | Count | Frequency (%) |
| sp | 1 | |
| baja | 1 | |
| california | 1 | |
| norte | 1 | |
| coronata | 1 | |
| diffusa | 1 | |
| splendens | 1 | |
| fulgens | 1 | |
| stricta | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| s | 6 | 9.7% |
| n | 5 | 8.1% |
| i | 4 | 6.5% |
| e | 4 | 6.5% |
| t | 4 | 6.5% |
| r | 4 | 6.5% |
| o | 4 | 6.5% |
| f | 4 | 6.5% |
| l | 3 | 4.8% |
| Other values (11) | 16 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 56 | |
| Uppercase Letter | 3 | 4.8% |
| Space Separator | 2 | 3.2% |
| Other Punctuation | 1 | 1.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| s | 6 | |
| n | 5 | |
| i | 4 | 7.1% |
| e | 4 | 7.1% |
| t | 4 | 7.1% |
| r | 4 | 7.1% |
| o | 4 | 7.1% |
| f | 4 | 7.1% |
| l | 3 | 5.4% |
| Other values (6) | 10 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| N | 1 | |
| B | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59 | |
| Common | 3 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| s | 6 | |
| n | 5 | 8.5% |
| i | 4 | 6.8% |
| e | 4 | 6.8% |
| t | 4 | 6.8% |
| r | 4 | 6.8% |
| o | 4 | 6.8% |
| f | 4 | 6.8% |
| l | 3 | 5.1% |
| Other values (9) | 13 |
Common
| Value | Count | Frequency (%) |
| 2 | ||
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 62 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| s | 6 | 9.7% |
| n | 5 | 8.1% |
| i | 4 | 6.5% |
| e | 4 | 6.5% |
| t | 4 | 6.5% |
| r | 4 | 6.5% |
| o | 4 | 6.5% |
| f | 4 | 6.5% |
| l | 3 | 4.8% |
| Other values (11) | 16 |
member
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 15 |
| Mean length | 15 |
| Min length | 12 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Colpomenia sinuosa |
|---|---|
| 2nd row | Ochtodes sp. |
| Value | Count | Frequency (%) |
| colpomenia | 1 | |
| sinuosa | 1 | |
| ochtodes | 1 | |
| sp | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 4 | |
| s | 4 | |
| 2 | 6.7% | |
| p | 2 | 6.7% |
| e | 2 | 6.7% |
| n | 2 | 6.7% |
| i | 2 | 6.7% |
| a | 2 | 6.7% |
| c | 1 | 3.3% |
| d | 1 | 3.3% |
| Other values (8) | 8 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25 | |
| Space Separator | 2 | 6.7% |
| Uppercase Letter | 2 | 6.7% |
| Other Punctuation | 1 | 3.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 4 | |
| s | 4 | |
| p | 2 | |
| e | 2 | |
| n | 2 | |
| i | 2 | |
| a | 2 | |
| c | 1 | 4.0% |
| d | 1 | 4.0% |
| t | 1 | 4.0% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| O | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27 | |
| Common | 3 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 4 | |
| s | 4 | |
| p | 2 | 7.4% |
| e | 2 | 7.4% |
| n | 2 | 7.4% |
| i | 2 | 7.4% |
| a | 2 | 7.4% |
| c | 1 | 3.7% |
| d | 1 | 3.7% |
| t | 1 | 3.7% |
| Other values (6) | 6 |
Common
| Value | Count | Frequency (%) |
| 2 | ||
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 4 | |
| s | 4 | |
| 2 | 6.7% | |
| p | 2 | 6.7% |
| e | 2 | 6.7% |
| n | 2 | 6.7% |
| i | 2 | 6.7% |
| a | 2 | 6.7% |
| c | 1 | 3.3% |
| d | 1 | 3.3% |
| Other values (8) | 8 |
bed
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 17 |
| Min length | 17 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Riccardia pinguis |
|---|
| Value | Count | Frequency (%) |
| riccardia | 1 | |
| pinguis | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4 | |
| c | 2 | |
| a | 2 | |
| R | 1 | 5.9% |
| r | 1 | 5.9% |
| d | 1 | 5.9% |
| 1 | 5.9% | |
| p | 1 | 5.9% |
| n | 1 | 5.9% |
| g | 1 | 5.9% |
| Other values (2) | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15 | |
| Uppercase Letter | 1 | 5.9% |
| Space Separator | 1 | 5.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4 | |
| c | 2 | |
| a | 2 | |
| r | 1 | 6.7% |
| d | 1 | 6.7% |
| p | 1 | 6.7% |
| n | 1 | 6.7% |
| g | 1 | 6.7% |
| u | 1 | 6.7% |
| s | 1 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16 | |
| Common | 1 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 4 | |
| c | 2 | |
| a | 2 | |
| R | 1 | 6.2% |
| r | 1 | 6.2% |
| d | 1 | 6.2% |
| p | 1 | 6.2% |
| n | 1 | 6.2% |
| g | 1 | 6.2% |
| u | 1 | 6.2% |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 4 | |
| c | 2 | |
| a | 2 | |
| R | 1 | 5.9% |
| r | 1 | 5.9% |
| d | 1 | 5.9% |
| 1 | 5.9% | |
| p | 1 | 5.9% |
| n | 1 | 5.9% |
| g | 1 | 5.9% |
| Other values (2) | 2 |
identificationID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 34 |
| Mean length | 34 |
| Min length | 34 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Guadalupe Island, Baja California. |
|---|
| Value | Count | Frequency (%) |
| guadalupe | 1 | |
| island | 1 | |
| baja | 1 | |
| california | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7 | |
| l | 3 | 8.8% |
| 3 | 8.8% | |
| n | 2 | 5.9% |
| d | 2 | 5.9% |
| i | 2 | 5.9% |
| u | 2 | 5.9% |
| j | 1 | 2.9% |
| r | 1 | 2.9% |
| o | 1 | 2.9% |
| Other values (10) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25 | |
| Uppercase Letter | 4 | 11.8% |
| Space Separator | 3 | 8.8% |
| Other Punctuation | 2 | 5.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7 | |
| l | 3 | |
| n | 2 | 8.0% |
| d | 2 | 8.0% |
| i | 2 | 8.0% |
| u | 2 | 8.0% |
| j | 1 | 4.0% |
| r | 1 | 4.0% |
| o | 1 | 4.0% |
| f | 1 | 4.0% |
| Other values (3) | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| G | 1 | |
| B | 1 | |
| I | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 | |
| . | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29 | |
| Common | 5 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7 | |
| l | 3 | |
| n | 2 | 6.9% |
| d | 2 | 6.9% |
| i | 2 | 6.9% |
| u | 2 | 6.9% |
| j | 1 | 3.4% |
| r | 1 | 3.4% |
| o | 1 | 3.4% |
| f | 1 | 3.4% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 3 | ||
| , | 1 | 20.0% |
| . | 1 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7 | |
| l | 3 | 8.8% |
| 3 | 8.8% | |
| n | 2 | 5.9% |
| d | 2 | 5.9% |
| i | 2 | 5.9% |
| u | 2 | 5.9% |
| j | 1 | 2.9% |
| r | 1 | 2.9% |
| o | 1 | 2.9% |
| Other values (10) | 10 |
Missing 
| Distinct | 32 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3799723 |
| Missing (%) | 99.6% |
| Memory size | 29.1 MiB |
Length
| Max length | 64 |
|---|---|
| Median length | 3 |
| Mean length | 4.326377295 |
| Min length | 2 |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | near |
|---|---|
| 2nd row | cf. |
| 3rd row | cf. |
| 4th row | vel aff. |
| 5th row | vel aff. |
| Value | Count | Frequency (%) |
| cf | 9529 | |
| uncertain | 2623 | 18.0% |
| aff | 1483 | 10.2% |
| near | 410 | 2.8% |
| s.l | 211 | 1.4% |
| vel | 146 | 1.0% |
| group | 45 | 0.3% |
| sp | 38 | 0.3% |
| subgroup | 35 | 0.2% |
| nov | 23 | 0.2% |
| Other values (23) | 53 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 12495 | |
| c | 12171 | |
| . | 11441 | |
| n | 5696 | |
| a | 4533 | 7.3% |
| e | 3208 | 5.2% |
| r | 3120 | 5.0% |
| u | 2664 | 4.3% |
| t | 2636 | 4.2% |
| i | 2632 | 4.2% |
| Other values (26) | 1600 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 50426 | |
| Other Punctuation | 11447 | 18.4% |
| Space Separator | 220 | 0.4% |
| Uppercase Letter | 99 | 0.2% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 12495 | |
| c | 12171 | |
| n | 5696 | |
| a | 4533 | 9.0% |
| e | 3208 | 6.4% |
| r | 3120 | 6.2% |
| u | 2664 | 5.3% |
| t | 2636 | 5.2% |
| i | 2632 | 5.2% |
| l | 376 | 0.7% |
| Other values (12) | 895 | 1.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 81 | |
| C | 5 | 5.1% |
| D | 3 | 3.0% |
| A | 2 | 2.0% |
| B | 2 | 2.0% |
| S | 2 | 2.0% |
| L | 2 | 2.0% |
| P | 1 | 1.0% |
| N | 1 | 1.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11441 | |
| , | 6 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 220 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50525 | |
| Common | 11671 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 12495 | |
| c | 12171 | |
| n | 5696 | |
| a | 4533 | 9.0% |
| e | 3208 | 6.3% |
| r | 3120 | 6.2% |
| u | 2664 | 5.3% |
| t | 2636 | 5.2% |
| i | 2632 | 5.2% |
| l | 376 | 0.7% |
| Other values (21) | 994 | 2.0% |
Common
| Value | Count | Frequency (%) |
| . | 11441 | |
| 220 | 1.9% | |
| , | 6 | 0.1% |
| ( | 2 | < 0.1% |
| ) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 62196 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 12495 | |
| c | 12171 | |
| . | 11441 | |
| n | 5696 | |
| a | 4533 | 7.3% |
| e | 3208 | 5.2% |
| r | 3120 | 5.0% |
| u | 2664 | 4.3% |
| t | 2636 | 4.2% |
| i | 2632 | 4.2% |
| Other values (26) | 1600 | 2.6% |
typeStatus
Text
Missing 
| Distinct | 254 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3664511 |
| Missing (%) | 96.1% |
| Memory size | 29.1 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 8 |
| Mean length | 7.885899938 |
| Min length | 1 |
Unique
| Unique | 108 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Type |
|---|---|
| 2nd row | Holotype |
| 3rd row | Type |
| 4th row | Holotype |
| 5th row | Holotype |
| Value | Count | Frequency (%) |
| holotype | 43277 | |
| paratype | 31271 | |
| type | 25920 | |
| isotype | 25475 | |
| syntype | 13189 | 8.2% |
| collection | 3918 | 2.4% |
| lectotype | 3353 | 2.1% |
| isosyntype | 2798 | 1.7% |
| fragment | 2239 | 1.4% |
| allotype | 1694 | 1.1% |
| Other values (55) | 7712 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| y | 168855 | |
| e | 165218 | |
| p | 151601 | |
| t | 138010 | |
| o | 134667 | |
| a | 69582 | 5.9% |
| l | 58287 | 4.9% |
| H | 43384 | 3.7% |
| r | 37775 | 3.2% |
| s | 34976 | 3.0% |
| Other values (35) | 177281 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1006069 | |
| Uppercase Letter | 160404 | 13.6% |
| Space Separator | 11258 | 1.0% |
| Other Punctuation | 1464 | 0.1% |
| Math Symbol | 437 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 168855 | |
| e | 165218 | |
| p | 151601 | |
| t | 138010 | |
| o | 134667 | |
| a | 69582 | |
| l | 58287 | 5.8% |
| r | 37775 | 3.8% |
| s | 34976 | 3.5% |
| n | 23037 | 2.3% |
| Other values (11) | 24061 | 2.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 43384 | |
| P | 34860 | |
| I | 29652 | |
| T | 25922 | |
| S | 13362 | 8.3% |
| C | 4936 | 3.1% |
| L | 3359 | 2.1% |
| F | 2239 | 1.4% |
| A | 1697 | 1.1% |
| N | 452 | 0.3% |
| Other values (7) | 541 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 1428 | |
| ? | 34 | 2.3% |
| . | 2 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 11258 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 437 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1166473 | |
| Common | 13163 | 1.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| y | 168855 | |
| e | 165218 | |
| p | 151601 | |
| t | 138010 | |
| o | 134667 | |
| a | 69582 | |
| l | 58287 | 5.0% |
| H | 43384 | 3.7% |
| r | 37775 | 3.2% |
| s | 34976 | 3.0% |
| Other values (28) | 164118 |
Common
| Value | Count | Frequency (%) |
| 11258 | ||
| ; | 1428 | 10.8% |
| + | 437 | 3.3% |
| ? | 34 | 0.3% |
| ( | 2 | < 0.1% |
| ) | 2 | < 0.1% |
| . | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1179636 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| y | 168855 | |
| e | 165218 | |
| p | 151601 | |
| t | 138010 | |
| o | 134667 | |
| a | 69582 | 5.9% |
| l | 58287 | 4.9% |
| H | 43384 | 3.7% |
| r | 37775 | 3.2% |
| s | 34976 | 3.0% |
| Other values (35) | 177281 |
identifiedBy
Text
Missing 
| Distinct | 18525 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 3157857 |
| Missing (%) | 82.8% |
| Memory size | 29.1 MiB |
Length
| Max length | 226 |
|---|---|
| Median length | 141 |
| Mean length | 36.93504987 |
| Min length | 2 |
Unique
| Unique | 6662 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | Badley, J. E. |
|---|---|
| 2nd row | Strong, M. T., (US), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 3rd row | Johnson, M. W. |
| 4th row | Zibrowius, Helmut, (CNRS-UA 41), Centre d'Oceanologie de Marseille (CNRS-UA 41) (FRANCE) |
| 5th row | Foster, W. D. |
| Value | Count | Frequency (%) |
| of | 165315 | 4.6% |
| museum | 142804 | 3.9% |
| national | 141794 | 3.9% |
| institution | 137456 | 3.8% |
| smithsonian | 136494 | 3.8% |
| natural | 136113 | 3.8% |
| history | 135920 | 3.8% |
| united | 123630 | 3.4% |
| states | 123300 | 3.4% |
| 98263 | 2.7% | |
| Other values (13036) | 2275712 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2960559 | 12.2% | |
| a | 1451424 | 6.0% |
| t | 1441525 | 5.9% |
| i | 1422745 | 5.9% |
| n | 1327574 | 5.5% |
| o | 1302984 | 5.4% |
| e | 1067690 | 4.4% |
| , | 1034400 | 4.3% |
| r | 1025293 | 4.2% |
| s | 943871 | 3.9% |
| Other values (99) | 10260266 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13847813 | |
| Uppercase Letter | 4898288 | 20.2% |
| Space Separator | 2960559 | 12.2% |
| Other Punctuation | 1906878 | 7.9% |
| Open Punctuation | 252924 | 1.0% |
| Close Punctuation | 252924 | 1.0% |
| Dash Punctuation | 116493 | 0.5% |
| Decimal Number | 2356 | < 0.1% |
| Math Symbol | 39 | < 0.1% |
| Initial Punctuation | 28 | < 0.1% |
| Other values (2) | 29 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1451424 | |
| t | 1441525 | |
| i | 1422745 | |
| n | 1327574 | |
| o | 1302984 | |
| e | 1067690 | |
| r | 1025293 | |
| s | 943871 | 6.8% |
| u | 787715 | 5.7% |
| l | 695948 | 5.0% |
| Other values (42) | 2381044 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 576195 | |
| T | 484786 | 9.9% |
| N | 467905 | 9.6% |
| E | 358991 | 7.3% |
| M | 338951 | 6.9% |
| I | 330825 | 6.8% |
| A | 282381 | 5.8% |
| H | 279825 | 5.7% |
| D | 247568 | 5.1% |
| U | 197622 | 4.0% |
| Other values (21) | 1333239 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1034400 | |
| . | 822681 | |
| ; | 37495 | 2.0% |
| / | 6890 | 0.4% |
| & | 2690 | 0.1% |
| ' | 2180 | 0.1% |
| " | 522 | < 0.1% |
| ¡ | 14 | < 0.1% |
| ? | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1102 | |
| 4 | 1101 | |
| 2 | 59 | 2.5% |
| 0 | 34 | 1.4% |
| 9 | 31 | 1.3% |
| 6 | 29 | 1.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 251996 | |
| [ | 928 | 0.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 251996 | |
| ] | 928 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 116489 | |
| – | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2960559 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 39 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 28 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 28 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18746101 | |
| Common | 5492230 | 22.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1451424 | 7.7% |
| t | 1441525 | 7.7% |
| i | 1422745 | 7.6% |
| n | 1327574 | 7.1% |
| o | 1302984 | 7.0% |
| e | 1067690 | 5.7% |
| r | 1025293 | 5.5% |
| s | 943871 | 5.0% |
| u | 787715 | 4.2% |
| l | 695948 | 3.7% |
| Other values (73) | 7279332 |
Common
| Value | Count | Frequency (%) |
| 2960559 | ||
| , | 1034400 | 18.8% |
| . | 822681 | 15.0% |
| ( | 251996 | 4.6% |
| ) | 251996 | 4.6% |
| - | 116489 | 2.1% |
| ; | 37495 | 0.7% |
| / | 6890 | 0.1% |
| & | 2690 | < 0.1% |
| ' | 2180 | < 0.1% |
| Other values (16) | 4854 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24222691 | |
| None | 15580 | 0.1% |
| Punctuation | 60 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2960559 | 12.2% | |
| a | 1451424 | 6.0% |
| t | 1441525 | 6.0% |
| i | 1422745 | 5.9% |
| n | 1327574 | 5.5% |
| o | 1302984 | 5.4% |
| e | 1067690 | 4.4% |
| , | 1034400 | 4.3% |
| r | 1025293 | 4.2% |
| s | 943871 | 3.9% |
| Other values (63) | 10244626 |
None
| Value | Count | Frequency (%) |
| í | 8096 | |
| é | 1950 | 12.5% |
| á | 1861 | 11.9% |
| ñ | 771 | 4.9% |
| ö | 715 | 4.6% |
| ü | 490 | 3.1% |
| ó | 443 | 2.8% |
| ä | 332 | 2.1% |
| ã | 286 | 1.8% |
| ú | 135 | 0.9% |
| Other values (23) | 501 | 3.2% |
Punctuation
| Value | Count | Frequency (%) |
| “ | 28 | |
| ” | 28 | |
| – | 4 | 6.7% |
identifiedByID
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814092 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 7 |
| Mean length | 25.28571429 |
| Min length | 7 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 37.7749 |
|---|---|
| 2nd row | Chromista, Ochrophyta, Phaeophyceae, Ectocarpales, Scytosiphonaceae |
| 3rd row | 34.7745 |
| 4th row | Dicotyledonae |
| 5th row | 59.4381 |
| Value | Count | Frequency (%) |
| 37.7749 | 1 | 6.7% |
| chromista | 1 | 6.7% |
| ochrophyta | 1 | 6.7% |
| phaeophyceae | 1 | 6.7% |
| ectocarpales | 1 | 6.7% |
| scytosiphonaceae | 1 | 6.7% |
| 34.7745 | 1 | 6.7% |
| dicotyledonae | 1 | 6.7% |
| 59.4381 | 1 | 6.7% |
| 41.5265 | 1 | 6.7% |
| Other values (5) | 5 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 17 | 9.6% |
| e | 15 | 8.5% |
| o | 13 | 7.3% |
| h | 11 | 6.2% |
| c | 9 | 5.1% |
| i | 8 | 4.5% |
| t | 8 | 4.5% |
| 8 | 4.5% | |
| , | 8 | 4.5% |
| p | 7 | 4.0% |
| Other values (28) | 73 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 122 | |
| Decimal Number | 24 | 13.6% |
| Other Punctuation | 12 | 6.8% |
| Uppercase Letter | 11 | 6.2% |
| Space Separator | 8 | 4.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 17 | |
| e | 15 | |
| o | 13 | |
| h | 11 | |
| c | 9 | |
| i | 8 | 6.6% |
| t | 8 | 6.6% |
| p | 7 | 5.7% |
| l | 7 | 5.7% |
| y | 7 | 5.7% |
| Other values (7) | 20 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 5 | |
| 7 | 5 | |
| 5 | 4 | |
| 3 | 3 | |
| 1 | 2 | 8.3% |
| 9 | 2 | 8.3% |
| 6 | 1 | 4.2% |
| 2 | 1 | 4.2% |
| 8 | 1 | 4.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 | |
| R | 2 | |
| G | 1 | |
| F | 1 | |
| E | 1 | |
| D | 1 | |
| C | 1 | |
| S | 1 | |
| O | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 8 | |
| . | 4 |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 133 | |
| Common | 44 | 24.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 17 | |
| e | 15 | |
| o | 13 | |
| h | 11 | 8.3% |
| c | 9 | 6.8% |
| i | 8 | 6.0% |
| t | 8 | 6.0% |
| p | 7 | 5.3% |
| l | 7 | 5.3% |
| y | 7 | 5.3% |
| Other values (16) | 31 |
Common
| Value | Count | Frequency (%) |
| 8 | ||
| , | 8 | |
| 4 | 5 | |
| 7 | 5 | |
| 5 | 4 | |
| . | 4 | |
| 3 | 3 | 6.8% |
| 1 | 2 | 4.5% |
| 9 | 2 | 4.5% |
| 6 | 1 | 2.3% |
| Other values (2) | 2 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 177 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 17 | 9.6% |
| e | 15 | 8.5% |
| o | 13 | 7.3% |
| h | 11 | 6.2% |
| c | 9 | 5.1% |
| i | 8 | 4.5% |
| t | 8 | 4.5% |
| 8 | 4.5% | |
| , | 8 | 4.5% |
| p | 7 | 4.0% |
| Other values (28) | 73 |
dateIdentified
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814090 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 18 |
| Mean length | 16 |
| Min length | 7 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -122.419 |
|---|---|
| 2nd row | Chromista |
| 3rd row | -96.6783 |
| 4th row | Asterales |
| 5th row | -151.711 |
| Value | Count | Frequency (%) |
| plantae | 2 | |
| 122.419 | 1 | 7.1% |
| chromista | 1 | 7.1% |
| 96.6783 | 1 | 7.1% |
| asterales | 1 | 7.1% |
| 151.711 | 1 | 7.1% |
| guatteria | 1 | 7.1% |
| punctata | 1 | 7.1% |
| 70.6731 | 1 | 7.1% |
| marchantiophyta | 1 | 7.1% |
| Other values (3) | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 18 | 12.5% |
| e | 12 | 8.3% |
| t | 11 | 7.6% |
| n | 8 | 5.6% |
| r | 7 | 4.9% |
| 1 | 7 | 4.9% |
| i | 6 | 4.2% |
| s | 5 | 3.5% |
| 5 | 3.5% | |
| u | 4 | 2.8% |
| Other values (28) | 61 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 94 | |
| Decimal Number | 24 | 16.7% |
| Uppercase Letter | 9 | 6.2% |
| Other Punctuation | 8 | 5.6% |
| Space Separator | 5 | 3.5% |
| Dash Punctuation | 4 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 18 | |
| e | 12 | |
| t | 11 | |
| n | 8 | |
| r | 7 | 7.4% |
| i | 6 | 6.4% |
| s | 5 | 5.3% |
| u | 4 | 4.3% |
| l | 4 | 4.3% |
| o | 3 | 3.2% |
| Other values (8) | 16 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 7 | 4 | |
| 6 | 3 | |
| 3 | 2 | 8.3% |
| 9 | 2 | 8.3% |
| 2 | 2 | 8.3% |
| 5 | 1 | 4.2% |
| 8 | 1 | 4.2% |
| 0 | 1 | 4.2% |
| 4 | 1 | 4.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 | |
| M | 2 | |
| A | 2 | |
| G | 1 | |
| C | 1 | |
| J | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4 | |
| . | 4 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 103 | |
| Common | 41 | 28.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 18 | |
| e | 12 | |
| t | 11 | |
| n | 8 | 7.8% |
| r | 7 | 6.8% |
| i | 6 | 5.8% |
| s | 5 | 4.9% |
| u | 4 | 3.9% |
| l | 4 | 3.9% |
| o | 3 | 2.9% |
| Other values (14) | 25 |
Common
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 5 | ||
| , | 4 | |
| 7 | 4 | |
| . | 4 | |
| - | 4 | |
| 6 | 3 | |
| 3 | 2 | 4.9% |
| 9 | 2 | 4.9% |
| 2 | 2 | 4.9% |
| Other values (4) | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 144 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 18 | 12.5% |
| e | 12 | 8.3% |
| t | 11 | 7.6% |
| n | 8 | 5.6% |
| r | 7 | 4.9% |
| 1 | 7 | 4.9% |
| i | 6 | 4.2% |
| s | 5 | 3.5% |
| 5 | 3.5% | |
| u | 4 | 2.8% |
| Other values (28) | 61 |
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 83.3% |
| Missing | 3814093 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.333333333 |
| Min length | 5 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 66.7% |
Sample
| 1st row | Ochrophyta |
|---|---|
| 2nd row | WGS84 |
| 3rd row | WGS84 |
| 4th row | Rhodophyta |
| 5th row | United States |
| Value | Count | Frequency (%) |
| wgs84 | 2 | |
| ochrophyta | 1 | |
| rhodophyta | 1 | |
| united | 1 | |
| states | 1 | |
| plantae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 6 | 12.0% |
| a | 5 | 10.0% |
| h | 4 | 8.0% |
| S | 3 | 6.0% |
| e | 3 | 6.0% |
| o | 3 | 6.0% |
| y | 2 | 4.0% |
| n | 2 | 4.0% |
| d | 2 | 4.0% |
| G | 2 | 4.0% |
| Other values (14) | 18 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34 | |
| Uppercase Letter | 11 | 22.0% |
| Decimal Number | 4 | 8.0% |
| Space Separator | 1 | 2.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 6 | |
| a | 5 | |
| h | 4 | |
| e | 3 | |
| o | 3 | |
| y | 2 | 5.9% |
| n | 2 | 5.9% |
| d | 2 | 5.9% |
| p | 2 | 5.9% |
| r | 1 | 2.9% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3 | |
| G | 2 | |
| W | 2 | |
| R | 1 | 9.1% |
| O | 1 | 9.1% |
| U | 1 | 9.1% |
| P | 1 | 9.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 8 | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45 | |
| Common | 5 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 6 | |
| a | 5 | 11.1% |
| h | 4 | 8.9% |
| S | 3 | 6.7% |
| e | 3 | 6.7% |
| o | 3 | 6.7% |
| y | 2 | 4.4% |
| n | 2 | 4.4% |
| d | 2 | 4.4% |
| G | 2 | 4.4% |
| Other values (11) | 13 |
Common
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 8 | 2 | |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 6 | 12.0% |
| a | 5 | 10.0% |
| h | 4 | 8.0% |
| S | 3 | 6.0% |
| e | 3 | 6.0% |
| o | 3 | 6.0% |
| y | 2 | 4.0% |
| n | 2 | 4.0% |
| d | 2 | 4.0% |
| G | 2 | 4.0% |
| Other values (14) | 18 |
identificationVerificationStatus
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814095 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 13.75 |
| Min length | 12 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Phaeophyceae |
|---|---|
| 2nd row | Campanulaceae |
| 3rd row | Florideophyceae |
| 4th row | Marchantiophyta |
| Value | Count | Frequency (%) |
| phaeophyceae | 1 | |
| campanulaceae | 1 | |
| florideophyceae | 1 | |
| marchantiophyta | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 10 | |
| e | 8 | |
| h | 5 | |
| o | 4 | 7.3% |
| p | 4 | 7.3% |
| c | 4 | 7.3% |
| y | 3 | 5.5% |
| t | 2 | 3.6% |
| r | 2 | 3.6% |
| n | 2 | 3.6% |
| Other values (9) | 11 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 51 | |
| Uppercase Letter | 4 | 7.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10 | |
| e | 8 | |
| h | 5 | |
| o | 4 | 7.8% |
| p | 4 | 7.8% |
| c | 4 | 7.8% |
| y | 3 | 5.9% |
| t | 2 | 3.9% |
| r | 2 | 3.9% |
| n | 2 | 3.9% |
| Other values (5) | 7 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 | |
| P | 1 | |
| F | 1 | |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 55 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 10 | |
| e | 8 | |
| h | 5 | |
| o | 4 | 7.3% |
| p | 4 | 7.3% |
| c | 4 | 7.3% |
| y | 3 | 5.5% |
| t | 2 | 3.6% |
| r | 2 | 3.6% |
| n | 2 | 3.6% |
| Other values (9) | 11 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 10 | |
| e | 8 | |
| h | 5 | |
| o | 4 | 7.3% |
| p | 4 | 7.3% |
| c | 4 | 7.3% |
| y | 3 | 5.5% |
| t | 2 | 3.6% |
| r | 2 | 3.6% |
| n | 2 | 3.6% |
| Other values (9) | 11 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814095 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 14.5 |
| Mean length | 12.5 |
| Min length | 9 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Ectocarpales |
|---|---|
| 2nd row | Gigartinales |
| 3rd row | Louisiana |
| 4th row | Jungermanniopsida |
| Value | Count | Frequency (%) |
| ectocarpales | 1 | |
| gigartinales | 1 | |
| louisiana | 1 | |
| jungermanniopsida | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 6 | |
| n | 5 | |
| s | 4 | 8.0% |
| o | 3 | 6.0% |
| r | 3 | 6.0% |
| e | 3 | 6.0% |
| u | 2 | 4.0% |
| t | 2 | 4.0% |
| p | 2 | 4.0% |
| Other values (9) | 12 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 46 | |
| Uppercase Letter | 4 | 8.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 6 | |
| n | 5 | |
| s | 4 | |
| o | 3 | 6.5% |
| r | 3 | 6.5% |
| e | 3 | 6.5% |
| u | 2 | 4.3% |
| t | 2 | 4.3% |
| p | 2 | 4.3% |
| Other values (5) | 8 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 1 | |
| E | 1 | |
| L | 1 | |
| G | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 6 | |
| n | 5 | |
| s | 4 | 8.0% |
| o | 3 | 6.0% |
| r | 3 | 6.0% |
| e | 3 | 6.0% |
| u | 2 | 4.0% |
| t | 2 | 4.0% |
| p | 2 | 4.0% |
| Other values (9) | 12 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 6 | |
| n | 5 | |
| s | 4 | 8.0% |
| o | 3 | 6.0% |
| r | 3 | 6.0% |
| e | 3 | 6.0% |
| u | 2 | 4.0% |
| t | 2 | 4.0% |
| p | 2 | 4.0% |
| Other values (9) | 12 |
taxonID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Metzgeriales |
|---|
| Value | Count | Frequency (%) |
| metzgeriales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3 | |
| M | 1 | 8.3% |
| t | 1 | 8.3% |
| z | 1 | 8.3% |
| g | 1 | 8.3% |
| r | 1 | 8.3% |
| i | 1 | 8.3% |
| a | 1 | 8.3% |
| l | 1 | 8.3% |
| s | 1 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11 | |
| Uppercase Letter | 1 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 1 | 9.1% |
| z | 1 | 9.1% |
| g | 1 | 9.1% |
| r | 1 | 9.1% |
| i | 1 | 9.1% |
| a | 1 | 9.1% |
| l | 1 | 9.1% |
| s | 1 | 9.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3 | |
| M | 1 | 8.3% |
| t | 1 | 8.3% |
| z | 1 | 8.3% |
| g | 1 | 8.3% |
| r | 1 | 8.3% |
| i | 1 | 8.3% |
| a | 1 | 8.3% |
| l | 1 | 8.3% |
| s | 1 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3 | |
| M | 1 | 8.3% |
| t | 1 | 8.3% |
| z | 1 | 8.3% |
| g | 1 | 8.3% |
| r | 1 | 8.3% |
| i | 1 | 8.3% |
| a | 1 | 8.3% |
| l | 1 | 8.3% |
| s | 1 | 8.3% |
scientificNameID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 16.5 |
| Mean length | 16.5 |
| Min length | 16 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Scytosiphonaceae |
|---|---|
| 2nd row | Rhizophyllidaceae |
| Value | Count | Frequency (%) |
| scytosiphonaceae | 1 | |
| rhizophyllidaceae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 4 | |
| c | 3 | |
| o | 3 | |
| i | 3 | |
| h | 3 | |
| y | 2 | 6.1% |
| p | 2 | 6.1% |
| l | 2 | 6.1% |
| S | 1 | 3.0% |
| Other values (6) | 6 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31 | |
| Uppercase Letter | 2 | 6.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 4 | |
| c | 3 | |
| o | 3 | |
| i | 3 | |
| h | 3 | |
| y | 2 | |
| p | 2 | |
| l | 2 | |
| t | 1 | 3.2% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| R | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 4 | |
| c | 3 | |
| o | 3 | |
| i | 3 | |
| h | 3 | |
| y | 2 | 6.1% |
| p | 2 | 6.1% |
| l | 2 | 6.1% |
| S | 1 | 3.0% |
| Other values (6) | 6 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 4 | |
| c | 3 | |
| o | 3 | |
| i | 3 | |
| h | 3 | |
| y | 2 | 6.1% |
| p | 2 | 6.1% |
| l | 2 | 6.1% |
| S | 1 | 3.0% |
| Other values (6) | 6 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814096 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 8 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Campanula |
|---|---|
| 2nd row | Raceland |
| 3rd row | Aneuraceae |
| Value | Count | Frequency (%) |
| campanula | 1 | |
| raceland | 1 | |
| aneuraceae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7 | |
| e | 4 | |
| n | 3 | |
| u | 2 | 7.4% |
| l | 2 | 7.4% |
| c | 2 | 7.4% |
| C | 1 | 3.7% |
| m | 1 | 3.7% |
| p | 1 | 3.7% |
| R | 1 | 3.7% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24 | |
| Uppercase Letter | 3 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7 | |
| e | 4 | |
| n | 3 | |
| u | 2 | 8.3% |
| l | 2 | 8.3% |
| c | 2 | 8.3% |
| m | 1 | 4.2% |
| p | 1 | 4.2% |
| d | 1 | 4.2% |
| r | 1 | 4.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| R | 1 | |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7 | |
| e | 4 | |
| n | 3 | |
| u | 2 | 7.4% |
| l | 2 | 7.4% |
| c | 2 | 7.4% |
| C | 1 | 3.7% |
| m | 1 | 3.7% |
| p | 1 | 3.7% |
| R | 1 | 3.7% |
| Other values (3) | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7 | |
| e | 4 | |
| n | 3 | |
| u | 2 | 7.4% |
| l | 2 | 7.4% |
| c | 2 | 7.4% |
| C | 1 | 3.7% |
| m | 1 | 3.7% |
| p | 1 | 3.7% |
| R | 1 | 3.7% |
| Other values (3) | 3 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 68 |
|---|---|
| Median length | 68 |
| Mean length | 68 |
| Min length | 68 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Plantae, Dicotyledonae (basal), Magnoliales, Annonaceae, Annonoideae |
|---|
| Value | Count | Frequency (%) |
| plantae | 1 | |
| dicotyledonae | 1 | |
| basal | 1 | |
| magnoliales | 1 | |
| annonaceae | 1 | |
| annonoideae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 10 | |
| n | 9 | |
| e | 8 | |
| o | 6 | |
| 5 | 7.4% | |
| l | 5 | 7.4% |
| , | 4 | 5.9% |
| i | 3 | 4.4% |
| c | 2 | 2.9% |
| s | 2 | 2.9% |
| Other values (11) | 14 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 52 | |
| Space Separator | 5 | 7.4% |
| Uppercase Letter | 5 | 7.4% |
| Other Punctuation | 4 | 5.9% |
| Open Punctuation | 1 | 1.5% |
| Close Punctuation | 1 | 1.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10 | |
| n | 9 | |
| e | 8 | |
| o | 6 | |
| l | 5 | |
| i | 3 | 5.8% |
| c | 2 | 3.8% |
| s | 2 | 3.8% |
| d | 2 | 3.8% |
| t | 2 | 3.8% |
| Other values (3) | 3 | 5.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| D | 1 | |
| M | 1 | |
| P | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 57 | |
| Common | 11 | 16.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 10 | |
| n | 9 | |
| e | 8 | |
| o | 6 | |
| l | 5 | |
| i | 3 | 5.3% |
| c | 2 | 3.5% |
| s | 2 | 3.5% |
| d | 2 | 3.5% |
| A | 2 | 3.5% |
| Other values (7) | 8 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| , | 4 | |
| ( | 1 | 9.1% |
| ) | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 10 | |
| n | 9 | |
| e | 8 | |
| o | 6 | |
| 5 | 7.4% | |
| l | 5 | 7.4% |
| , | 4 | 5.9% |
| i | 3 | 4.4% |
| c | 2 | 2.9% |
| s | 2 | 2.9% |
| Other values (11) | 14 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Plantae |
|---|
| Value | Count | Frequency (%) |
| plantae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 1 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Colpomenia |
|---|---|
| 2nd row | Ochtodes |
| Value | Count | Frequency (%) |
| colpomenia | 1 | |
| ochtodes | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 3 | |
| e | 2 | 11.1% |
| C | 1 | 5.6% |
| l | 1 | 5.6% |
| p | 1 | 5.6% |
| m | 1 | 5.6% |
| n | 1 | 5.6% |
| i | 1 | 5.6% |
| a | 1 | 5.6% |
| O | 1 | 5.6% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16 | |
| Uppercase Letter | 2 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3 | |
| e | 2 | |
| l | 1 | 6.2% |
| p | 1 | 6.2% |
| m | 1 | 6.2% |
| n | 1 | 6.2% |
| i | 1 | 6.2% |
| a | 1 | 6.2% |
| c | 1 | 6.2% |
| h | 1 | 6.2% |
| Other values (3) | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| O | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3 | |
| e | 2 | 11.1% |
| C | 1 | 5.6% |
| l | 1 | 5.6% |
| p | 1 | 5.6% |
| m | 1 | 5.6% |
| n | 1 | 5.6% |
| i | 1 | 5.6% |
| a | 1 | 5.6% |
| O | 1 | 5.6% |
| Other values (5) | 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 3 | |
| e | 2 | 11.1% |
| C | 1 | 5.6% |
| l | 1 | 5.6% |
| p | 1 | 5.6% |
| m | 1 | 5.6% |
| n | 1 | 5.6% |
| i | 1 | 5.6% |
| a | 1 | 5.6% |
| O | 1 | 5.6% |
| Other values (5) | 5 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814096 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 12 |
| Mean length | 14 |
| Min length | 9 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | rotundifolia |
|---|---|
| 2nd row | Dicotyledonae (basal) |
| 3rd row | Riccardia |
| Value | Count | Frequency (%) |
| rotundifolia | 1 | |
| dicotyledonae | 1 | |
| basal | 1 | |
| riccardia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 5 | |
| o | 4 | 9.5% |
| d | 3 | 7.1% |
| l | 3 | 7.1% |
| c | 3 | 7.1% |
| r | 2 | 4.8% |
| t | 2 | 4.8% |
| n | 2 | 4.8% |
| e | 2 | 4.8% |
| Other values (10) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37 | |
| Uppercase Letter | 2 | 4.8% |
| Open Punctuation | 1 | 2.4% |
| Close Punctuation | 1 | 2.4% |
| Space Separator | 1 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 5 | |
| o | 4 | |
| d | 3 | |
| l | 3 | |
| c | 3 | |
| r | 2 | 5.4% |
| t | 2 | 5.4% |
| n | 2 | 5.4% |
| e | 2 | 5.4% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1 | |
| R | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39 | |
| Common | 3 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 5 | |
| o | 4 | |
| d | 3 | |
| l | 3 | |
| c | 3 | |
| r | 2 | 5.1% |
| t | 2 | 5.1% |
| n | 2 | 5.1% |
| e | 2 | 5.1% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| ( | 1 | |
| ) | 1 | |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 5 | |
| o | 4 | 9.5% |
| d | 3 | 7.1% |
| l | 3 | 7.1% |
| c | 3 | 7.1% |
| r | 2 | 4.8% |
| t | 2 | 4.8% |
| n | 2 | 4.8% |
| e | 2 | 4.8% |
| Other values (10) | 10 |
taxonConceptID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Magnoliales |
|---|
| Value | Count | Frequency (%) |
| magnoliales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| M | 1 | |
| g | 1 | |
| n | 1 | |
| o | 1 | |
| i | 1 | |
| e | 1 | |
| s | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 | |
| Uppercase Letter | 1 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| g | 1 | |
| n | 1 | |
| o | 1 | |
| i | 1 | |
| e | 1 | |
| s | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| M | 1 | |
| g | 1 | |
| n | 1 | |
| o | 1 | |
| i | 1 | |
| e | 1 | |
| s | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| M | 1 | |
| g | 1 | |
| n | 1 | |
| o | 1 | |
| i | 1 | |
| e | 1 | |
| s | 1 |
scientificName
Text
Missing 
| Distinct | 498139 |
|---|---|
| Distinct (%) | 13.6% |
| Missing | 152724 |
| Missing (%) | 4.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 125 |
|---|---|
| Median length | 97 |
| Mean length | 20.18342371 |
| Min length | 3 |
Unique
| Unique | 254723 ? |
|---|---|
| Unique (%) | 7.0% |
Sample
| 1st row | Lesquerella lescurii |
|---|---|
| 2nd row | Desmognathus ochrophaeus |
| 3rd row | Ninoe kinbergi |
| 4th row | Gomphus adelphus |
| 5th row | Skrjabinoclava catoptrophori |
| Value | Count | Frequency (%) |
| sp | 224114 | 2.8% |
| var | 87003 | 1.1% |
| plethodon | 69434 | 0.9% |
| subsp | 43445 | 0.5% |
| cinereus | 35438 | 0.4% |
| bombus | 28778 | 0.4% |
| carex | 23618 | 0.3% |
| indet | 17121 | 0.2% |
| peromyscus | 16160 | 0.2% |
| desmognathus | 14838 | 0.2% |
| Other values (211783) | 7432205 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8143176 | 11.0% |
| i | 6697207 | 9.1% |
| s | 5452327 | 7.4% |
| e | 4860883 | 6.6% |
| o | 4602351 | 6.2% |
| r | 4549912 | 6.2% |
| 4330779 | 5.9% | |
| u | 3993086 | 5.4% |
| l | 3969942 | 5.4% |
| n | 3867669 | 5.2% |
| Other values (88) | 23431751 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 65218673 | |
| Space Separator | 4330779 | 5.9% |
| Uppercase Letter | 3761747 | 5.1% |
| Other Punctuation | 400495 | 0.5% |
| Open Punctuation | 87330 | 0.1% |
| Close Punctuation | 87329 | 0.1% |
| Dash Punctuation | 9209 | < 0.1% |
| Decimal Number | 3384 | < 0.1% |
| Connector Punctuation | 120 | < 0.1% |
| Math Symbol | 16 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8143176 | |
| i | 6697207 | |
| s | 5452327 | 8.4% |
| e | 4860883 | 7.5% |
| o | 4602351 | 7.1% |
| r | 4549912 | 7.0% |
| u | 3993086 | 6.1% |
| l | 3969942 | 6.1% |
| n | 3867669 | 5.9% |
| t | 3451881 | 5.3% |
| Other values (27) | 15630239 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 560260 | |
| C | 497486 | |
| A | 346053 | 9.2% |
| S | 335540 | 8.9% |
| M | 252876 | 6.7% |
| L | 201820 | 5.4% |
| E | 192599 | 5.1% |
| T | 180945 | 4.8% |
| D | 169789 | 4.5% |
| B | 158459 | 4.2% |
| Other values (18) | 865920 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 391387 | |
| " | 3810 | 1.0% |
| , | 2144 | 0.5% |
| ' | 1809 | 0.5% |
| & | 937 | 0.2% |
| ? | 279 | 0.1% |
| / | 108 | < 0.1% |
| # | 18 | < 0.1% |
| ! | 1 | < 0.1% |
| † | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 974 | |
| 1 | 802 | |
| 0 | 734 | |
| 5 | 426 | |
| 9 | 114 | 3.4% |
| 8 | 90 | 2.7% |
| 3 | 84 | 2.5% |
| 7 | 64 | 1.9% |
| 4 | 53 | 1.6% |
| 6 | 43 | 1.3% |
Math Symbol
| Value | Count | Frequency (%) |
| × | 8 | |
| + | 5 | |
| ~ | 2 | 12.5% |
| = | 1 | 6.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 87296 | |
| [ | 34 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 87295 | |
| ] | 34 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4330779 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9209 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 120 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 68980420 | |
| Common | 4918663 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8143176 | |
| i | 6697207 | 9.7% |
| s | 5452327 | 7.9% |
| e | 4860883 | 7.0% |
| o | 4602351 | 6.7% |
| r | 4549912 | 6.6% |
| u | 3993086 | 5.8% |
| l | 3969942 | 5.8% |
| n | 3867669 | 5.6% |
| t | 3451881 | 5.0% |
| Other values (55) | 19391986 |
Common
| Value | Count | Frequency (%) |
| 4330779 | ||
| . | 391387 | 8.0% |
| ( | 87296 | 1.8% |
| ) | 87295 | 1.8% |
| - | 9209 | 0.2% |
| " | 3810 | 0.1% |
| , | 2144 | < 0.1% |
| ' | 1809 | < 0.1% |
| 2 | 974 | < 0.1% |
| & | 937 | < 0.1% |
| Other values (23) | 3023 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 73898592 | |
| None | 489 | < 0.1% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8143176 | 11.0% |
| i | 6697207 | 9.1% |
| s | 5452327 | 7.4% |
| e | 4860883 | 6.6% |
| o | 4602351 | 6.2% |
| r | 4549912 | 6.2% |
| 4330779 | 5.9% | |
| u | 3993086 | 5.4% |
| l | 3969942 | 5.4% |
| n | 3867669 | 5.2% |
| Other values (72) | 23431260 |
None
| Value | Count | Frequency (%) |
| ë | 292 | |
| ö | 51 | 10.4% |
| á | 45 | 9.2% |
| ü | 40 | 8.2% |
| Á | 20 | 4.1% |
| é | 15 | 3.1% |
| ó | 8 | 1.6% |
| × | 8 | 1.6% |
| É | 4 | 0.8% |
| ñ | 2 | 0.4% |
| Other values (4) | 4 | 0.8% |
Punctuation
| Value | Count | Frequency (%) |
| † | 1 | |
| ” | 1 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814096 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.666666667 |
| Min length | 3 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | sinuosa |
|---|---|
| 2nd row | Annonaceae |
| 3rd row | sp. |
| Value | Count | Frequency (%) |
| sinuosa | 1 | |
| annonaceae | 1 | |
| sp | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 4 | |
| s | 3 | |
| a | 3 | |
| o | 2 | |
| e | 2 | |
| i | 1 | 5.0% |
| u | 1 | 5.0% |
| A | 1 | 5.0% |
| c | 1 | 5.0% |
| p | 1 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18 | |
| Uppercase Letter | 1 | 5.0% |
| Other Punctuation | 1 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 4 | |
| s | 3 | |
| a | 3 | |
| o | 2 | |
| e | 2 | |
| i | 1 | 5.6% |
| u | 1 | 5.6% |
| c | 1 | 5.6% |
| p | 1 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19 | |
| Common | 1 | 5.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 4 | |
| s | 3 | |
| a | 3 | |
| o | 2 | |
| e | 2 | |
| i | 1 | 5.3% |
| u | 1 | 5.3% |
| A | 1 | 5.3% |
| c | 1 | 5.3% |
| p | 1 | 5.3% |
Common
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 4 | |
| s | 3 | |
| a | 3 | |
| o | 2 | |
| e | 2 | |
| i | 1 | 5.0% |
| u | 1 | 5.0% |
| A | 1 | 5.0% |
| c | 1 | 5.0% |
| p | 1 | 5.0% |
parentNameUsage
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | pinguis |
|---|
| Value | Count | Frequency (%) |
| pinguis | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2 | |
| p | 1 | |
| n | 1 | |
| g | 1 | |
| u | 1 | |
| s | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2 | |
| p | 1 | |
| n | 1 | |
| g | 1 | |
| u | 1 | |
| s | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2 | |
| p | 1 | |
| n | 1 | |
| g | 1 | |
| u | 1 | |
| s | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2 | |
| p | 1 | |
| n | 1 | |
| g | 1 | |
| u | 1 | |
| s | 1 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 5.5 |
| Mean length | 5.5 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | GEOLocate |
|---|---|
| 2nd row | L. |
| Value | Count | Frequency (%) |
| geolocate | 1 | |
| l | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| L | 2 | |
| G | 1 | |
| E | 1 | |
| O | 1 | |
| o | 1 | |
| c | 1 | |
| a | 1 | |
| t | 1 | |
| e | 1 | |
| . | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5 | |
| Lowercase Letter | 5 | |
| Other Punctuation | 1 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1 | |
| c | 1 | |
| a | 1 | |
| t | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 2 | |
| G | 1 | |
| E | 1 | |
| O | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 | |
| Common | 1 | 9.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 2 | |
| G | 1 | |
| E | 1 | |
| O | 1 | |
| o | 1 | |
| c | 1 | |
| a | 1 | |
| t | 1 | |
| e | 1 |
Common
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| L | 2 | |
| G | 1 | |
| E | 1 | |
| O | 1 | |
| o | 1 | |
| c | 1 | |
| a | 1 | |
| t | 1 | |
| e | 1 | |
| . | 1 |
namePublishedIn
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Guatteria |
|---|
| Value | Count | Frequency (%) |
| guatteria | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| t | 2 | |
| G | 1 | |
| u | 1 | |
| e | 1 | |
| r | 1 | |
| i | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8 | |
| Uppercase Letter | 1 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| t | 2 | |
| u | 1 | |
| e | 1 | |
| r | 1 | |
| i | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| t | 2 | |
| G | 1 | |
| u | 1 | |
| e | 1 | |
| r | 1 | |
| i | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| t | 2 | |
| G | 1 | |
| u | 1 | |
| e | 1 | |
| r | 1 | |
| i | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 34 |
| Mean length | 34 |
| Min length | 34 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | (K. Mert. ex Roth) Derbes & Solier |
|---|
| Value | Count | Frequency (%) |
| k | 1 | |
| mert | 1 | |
| ex | 1 | |
| roth | 1 | |
| derbes | 1 | |
| 1 | ||
| solier | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | ||
| e | 5 | |
| r | 3 | 8.8% |
| o | 2 | 5.9% |
| . | 2 | 5.9% |
| t | 2 | 5.9% |
| D | 1 | 2.9% |
| l | 1 | 2.9% |
| S | 1 | 2.9% |
| & | 1 | 2.9% |
| Other values (10) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18 | |
| Space Separator | 6 | 17.6% |
| Uppercase Letter | 5 | 14.7% |
| Other Punctuation | 3 | 8.8% |
| Open Punctuation | 1 | 2.9% |
| Close Punctuation | 1 | 2.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5 | |
| r | 3 | |
| o | 2 | 11.1% |
| t | 2 | 11.1% |
| l | 1 | 5.6% |
| s | 1 | 5.6% |
| b | 1 | 5.6% |
| h | 1 | 5.6% |
| x | 1 | 5.6% |
| i | 1 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1 | |
| S | 1 | |
| K | 1 | |
| R | 1 | |
| M | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 | |
| & | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 6 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23 | |
| Common | 11 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5 | |
| r | 3 | |
| o | 2 | 8.7% |
| t | 2 | 8.7% |
| D | 1 | 4.3% |
| l | 1 | 4.3% |
| S | 1 | 4.3% |
| s | 1 | 4.3% |
| b | 1 | 4.3% |
| h | 1 | 4.3% |
| Other values (5) | 5 |
Common
| Value | Count | Frequency (%) |
| 6 | ||
| . | 2 | 18.2% |
| & | 1 | 9.1% |
| ( | 1 | 9.1% |
| ) | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | ||
| e | 5 | |
| r | 3 | 8.8% |
| o | 2 | 5.9% |
| . | 2 | 5.9% |
| t | 2 | 5.9% |
| D | 1 | 2.9% |
| l | 1 | 2.9% |
| S | 1 | 2.9% |
| & | 1 | 2.9% |
| Other values (10) | 10 |
| Distinct | 10142 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 8025 |
| Missing (%) | 0.2% |
| Memory size | 29.1 MiB |
Length
| Max length | 164 |
|---|---|
| Median length | 148 |
| Mean length | 65.02585814 |
| Min length | 6 |
Unique
| Unique | 1548 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia, Arthropoda, Crustacea, Malacostraca, Eumalacostraca, Eucarida, Decapoda, Pleocyemata, Hippolytidae |
|---|---|
| 2nd row | Plantae, Dicotyledonae, Brassicales, Brassicaceae, Brassicoideae |
| 3rd row | Animalia, Chordata, Vertebrata, Amphibia, Caudata, Plethodontidae |
| 4th row | Animalia, Cnidaria, Anthozoa, Hexacorallia, Scleractinia |
| 5th row | Animalia, Annelida, Polychaeta, Errantia, Eunicida, Lumbrineridae |
| Value | Count | Frequency (%) |
| animalia | 1953363 | 9.1% |
| plantae | 1703265 | 7.9% |
| dicotyledonae | 1061726 | 4.9% |
| chordata | 924299 | 4.3% |
| vertebrata | 915878 | 4.3% |
| arthropoda | 407833 | 1.9% |
| monocotyledonae | 373594 | 1.7% |
| mollusca | 356697 | 1.7% |
| poales | 288260 | 1.3% |
| gastropoda | 251919 | 1.2% |
| Other values (10186) | 13296608 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 34476529 | |
| e | 24748170 | 10.0% |
| i | 18023886 | 7.3% |
| 17727368 | 7.2% | |
| , | 17673090 | 7.1% |
| o | 15704488 | 6.3% |
| t | 13361876 | 5.4% |
| l | 12174850 | 4.9% |
| r | 11460198 | 4.6% |
| n | 10609214 | 4.3% |
| Other values (63) | 71533559 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 190534785 | |
| Uppercase Letter | 21479072 | 8.7% |
| Space Separator | 17727368 | 7.2% |
| Other Punctuation | 17681466 | 7.1% |
| Open Punctuation | 35134 | < 0.1% |
| Close Punctuation | 35134 | < 0.1% |
| Dash Punctuation | 201 | < 0.1% |
| Connector Punctuation | 51 | < 0.1% |
| Decimal Number | 16 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 34476529 | |
| e | 24748170 | |
| i | 18023886 | |
| o | 15704488 | |
| t | 13361876 | 7.0% |
| l | 12174850 | 6.4% |
| r | 11460198 | 6.0% |
| n | 10609214 | 5.6% |
| d | 8725587 | 4.6% |
| c | 8272856 | 4.3% |
| Other values (17) | 32977131 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4453774 | |
| P | 3956585 | |
| C | 2595797 | |
| M | 1813334 | |
| D | 1362205 | 6.3% |
| V | 1021212 | 4.8% |
| E | 864737 | 4.0% |
| S | 853251 | 4.0% |
| L | 556792 | 2.6% |
| R | 552324 | 2.6% |
| Other values (16) | 3449061 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 3 | |
| 2 | 2 | |
| 9 | 2 | |
| 7 | 2 | |
| 0 | 2 | |
| 1 | 2 | |
| 3 | 2 | |
| 4 | 1 | 6.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 17673090 | |
| . | 8341 | < 0.1% |
| ? | 24 | < 0.1% |
| / | 11 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 35072 | |
| [ | 62 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 35072 | |
| ] | 62 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 17727368 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 201 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 51 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 212013857 | |
| Common | 35479371 | 14.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 34476529 | |
| e | 24748170 | |
| i | 18023886 | 8.5% |
| o | 15704488 | 7.4% |
| t | 13361876 | 6.3% |
| l | 12174850 | 5.7% |
| r | 11460198 | 5.4% |
| n | 10609214 | 5.0% |
| d | 8725587 | 4.1% |
| c | 8272856 | 3.9% |
| Other values (43) | 54456203 |
Common
| Value | Count | Frequency (%) |
| 17727368 | ||
| , | 17673090 | |
| ( | 35072 | 0.1% |
| ) | 35072 | 0.1% |
| . | 8341 | < 0.1% |
| - | 201 | < 0.1% |
| [ | 62 | < 0.1% |
| ] | 62 | < 0.1% |
| _ | 51 | < 0.1% |
| ? | 24 | < 0.1% |
| Other values (10) | 28 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 247492967 | |
| None | 261 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 34476529 | |
| e | 24748170 | 10.0% |
| i | 18023886 | 7.3% |
| 17727368 | 7.2% | |
| , | 17673090 | 7.1% |
| o | 15704488 | 6.3% |
| t | 13361876 | 5.4% |
| l | 12174850 | 4.9% |
| r | 11460198 | 4.6% |
| n | 10609214 | 4.3% |
| Other values (62) | 71533298 |
None
| Value | Count | Frequency (%) |
| ö | 261 |
kingdom
Text
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10040 |
| Missing (%) | 0.3% |
| Memory size | 29.1 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 8 |
| Mean length | 7.495749146 |
| Min length | 5 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Plantae |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 1953363 | |
| plantae | 1703246 | |
| fungi | 91813 | 2.4% |
| eubacteria | 21587 | 0.6% |
| chromista | 17285 | 0.5% |
| protista | 15845 | 0.4% |
| protozoa | 896 | < 0.1% |
| bacteria | 9 | < 0.1% |
| animalis | 6 | < 0.1% |
| animala | 4 | < 0.1% |
| Other values (7) | 9 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7390458 | |
| i | 4053285 | |
| n | 3748435 | |
| l | 3656619 | |
| m | 1970659 | 6.9% |
| A | 1953373 | 6.9% |
| t | 1774717 | 6.2% |
| e | 1724848 | 6.0% |
| P | 1719986 | 6.0% |
| u | 113402 | 0.4% |
| Other values (23) | 408490 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24710205 | |
| Uppercase Letter | 3804056 | 13.3% |
| Decimal Number | 5 | < 0.1% |
| Space Separator | 4 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7390458 | |
| i | 4053285 | |
| n | 3748435 | |
| l | 3656619 | |
| m | 1970659 | 8.0% |
| t | 1774717 | 7.2% |
| e | 1724848 | 7.0% |
| u | 113402 | 0.5% |
| g | 91814 | 0.4% |
| r | 55628 | 0.2% |
| Other values (10) | 130340 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1953373 | |
| P | 1719986 | |
| F | 91813 | 2.4% |
| E | 21589 | 0.6% |
| C | 17285 | 0.5% |
| B | 9 | < 0.1% |
| I | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 3 | |
| 0 | 1 | 20.0% |
| 5 | 1 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28514261 | |
| Common | 11 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7390458 | |
| i | 4053285 | |
| n | 3748435 | |
| l | 3656619 | |
| m | 1970659 | 6.9% |
| A | 1953373 | 6.9% |
| t | 1774717 | 6.2% |
| e | 1724848 | 6.0% |
| P | 1719986 | 6.0% |
| u | 113402 | 0.4% |
| Other values (17) | 408479 | 1.4% |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| 9 | 3 | |
| - | 1 | 9.1% |
| 0 | 1 | 9.1% |
| . | 1 | 9.1% |
| 5 | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28514272 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7390458 | |
| i | 4053285 | |
| n | 3748435 | |
| l | 3656619 | |
| m | 1970659 | 6.9% |
| A | 1953373 | 6.9% |
| t | 1774717 | 6.2% |
| e | 1724848 | 6.0% |
| P | 1719986 | 6.0% |
| u | 113402 | 0.4% |
| Other values (23) | 408490 | 1.4% |
phylum
Text
Missing 
| Distinct | 106 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1562087 |
| Missing (%) | 41.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 8 |
| Mean length | 8.845462635 |
| Min length | 5 |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Arthropoda |
|---|---|
| 2nd row | Chordata |
| 3rd row | Cnidaria |
| 4th row | Annelida |
| 5th row | Arthropoda |
| Value | Count | Frequency (%) |
| chordata | 924299 | |
| arthropoda | 407833 | |
| mollusca | 356697 | 15.8% |
| annelida | 99290 | 4.4% |
| ascomycota | 90632 | 4.0% |
| bryophyta | 61205 | 2.7% |
| rhodophyta | 50004 | 2.2% |
| cnidaria | 48058 | 2.1% |
| echinodermata | 37484 | 1.7% |
| nematoda | 28248 | 1.3% |
| Other values (105) | 149216 | 6.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3392588 | |
| o | 2654365 | |
| r | 2009507 | |
| t | 1753345 | |
| h | 1694517 | |
| d | 1599243 | |
| C | 1015241 | 5.1% |
| l | 905045 | 4.5% |
| c | 648529 | 3.3% |
| A | 600581 | 3.0% |
| Other values (42) | 3647127 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17666309 | |
| Uppercase Letter | 2251999 | 11.3% |
| Space Separator | 954 | < 0.1% |
| Other Punctuation | 660 | < 0.1% |
| Dash Punctuation | 117 | < 0.1% |
| Connector Punctuation | 47 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3392588 | |
| o | 2654365 | |
| r | 2009507 | |
| t | 1753345 | |
| h | 1694517 | |
| d | 1599243 | |
| l | 905045 | 5.1% |
| c | 648529 | 3.7% |
| p | 597572 | 3.4% |
| s | 469272 | 2.7% |
| Other values (14) | 1942326 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1015241 | |
| A | 600581 | |
| M | 370202 | 16.4% |
| B | 79873 | 3.5% |
| R | 50413 | 2.2% |
| P | 43753 | 1.9% |
| E | 37618 | 1.7% |
| N | 30920 | 1.4% |
| O | 12211 | 0.5% |
| S | 4597 | 0.2% |
| Other values (12) | 6590 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 1 | |
| 4 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 954 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 660 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 117 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 47 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19918308 | |
| Common | 1780 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3392588 | |
| o | 2654365 | |
| r | 2009507 | |
| t | 1753345 | |
| h | 1694517 | |
| d | 1599243 | |
| C | 1015241 | 5.1% |
| l | 905045 | 4.5% |
| c | 648529 | 3.3% |
| A | 600581 | 3.0% |
| Other values (36) | 3645347 |
Common
| Value | Count | Frequency (%) |
| 954 | ||
| . | 660 | |
| - | 117 | 6.6% |
| _ | 47 | 2.6% |
| 8 | 1 | 0.1% |
| 4 | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19920088 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3392588 | |
| o | 2654365 | |
| r | 2009507 | |
| t | 1753345 | |
| h | 1694517 | |
| d | 1599243 | |
| C | 1015241 | 5.1% |
| l | 905045 | 4.5% |
| c | 648529 | 3.3% |
| A | 600581 | 3.0% |
| Other values (42) | 3647127 |
class
Text
Missing 
| Distinct | 225 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 102065 |
| Missing (%) | 2.7% |
| Memory size | 29.1 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 20 |
| Mean length | 11.06579573 |
| Min length | 4 |
Unique
| Unique | 35 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Malacostraca |
|---|---|
| 2nd row | Dicotyledonae |
| 3rd row | Amphibia |
| 4th row | Anthozoa |
| 5th row | Polychaeta |
| Value | Count | Frequency (%) |
| dicotyledonae | 1061725 | |
| monocotyledonae | 373594 | 10.0% |
| gastropoda | 251919 | 6.7% |
| mammalia | 247286 | 6.6% |
| insecta | 242424 | 6.5% |
| aves | 240577 | 6.4% |
| actinopterygii | 183425 | 4.9% |
| amphibia | 162685 | 4.3% |
| malacostraca | 124107 | 3.3% |
| pteridophyte | 113570 | 3.0% |
| Other values (216) | 746222 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 5341978 | |
| a | 4658977 | |
| e | 4540797 | |
| t | 3012097 | 7.3% |
| i | 2919628 | 7.1% |
| c | 2508892 | 6.1% |
| n | 2461418 | 6.0% |
| l | 2238371 | 5.4% |
| d | 2073116 | 5.0% |
| y | 2053582 | 5.0% |
| Other values (41) | 9267754 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37258936 | |
| Uppercase Letter | 3712033 | 9.0% |
| Space Separator | 35500 | 0.1% |
| Open Punctuation | 35023 | 0.1% |
| Close Punctuation | 35023 | 0.1% |
| Other Punctuation | 95 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 5341978 | |
| a | 4658977 | |
| e | 4540797 | |
| t | 3012097 | |
| i | 2919628 | |
| c | 2508892 | |
| n | 2461418 | |
| l | 2238371 | |
| d | 2073116 | 5.6% |
| y | 2053582 | 5.5% |
| Other values (15) | 5450080 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1074606 | |
| M | 770943 | |
| A | 656642 | |
| G | 252003 | 6.8% |
| I | 243001 | 6.5% |
| P | 228128 | 6.1% |
| B | 142983 | 3.9% |
| L | 83972 | 2.3% |
| R | 78213 | 2.1% |
| C | 39792 | 1.1% |
| Other values (12) | 141750 | 3.8% |
Space Separator
| Value | Count | Frequency (%) |
| 35500 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 35023 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 35023 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 95 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40970969 | |
| Common | 105641 | 0.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 5341978 | |
| a | 4658977 | |
| e | 4540797 | |
| t | 3012097 | 7.4% |
| i | 2919628 | 7.1% |
| c | 2508892 | 6.1% |
| n | 2461418 | 6.0% |
| l | 2238371 | 5.5% |
| d | 2073116 | 5.1% |
| y | 2053582 | 5.0% |
| Other values (37) | 9162113 |
Common
| Value | Count | Frequency (%) |
| 35500 | ||
| ( | 35023 | |
| ) | 35023 | |
| . | 95 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41076610 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 5341978 | |
| a | 4658977 | |
| e | 4540797 | |
| t | 3012097 | 7.3% |
| i | 2919628 | 7.1% |
| c | 2508892 | 6.1% |
| n | 2461418 | 6.0% |
| l | 2238371 | 5.4% |
| d | 2073116 | 5.0% |
| y | 2053582 | 5.0% |
| Other values (41) | 9267754 |
order
Text
Missing 
| Distinct | 978 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 410734 |
| Missing (%) | 10.8% |
| Memory size | 29.1 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 22 |
| Mean length | 9.636402502 |
| Min length | 5 |
Unique
| Unique | 97 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Decapoda |
|---|---|
| 2nd row | Brassicales |
| 3rd row | Caudata |
| 4th row | Scleractinia |
| 5th row | Eunicida |
| Value | Count | Frequency (%) |
| poales | 288260 | 8.5% |
| asterales | 156438 | 4.6% |
| passeriformes | 152902 | 4.5% |
| rodentia | 122495 | 3.6% |
| lamiales | 109533 | 3.2% |
| fabales | 104514 | 3.1% |
| caudata | 97702 | 2.9% |
| perciformes | 88214 | 2.6% |
| malpighiales | 86512 | 2.5% |
| decapoda | 80987 | 2.4% |
| Other values (968) | 2116574 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4889746 | |
| e | 3854264 | |
| s | 3030750 | 9.2% |
| l | 2735926 | 8.3% |
| o | 2274535 | 6.9% |
| i | 2201734 | 6.7% |
| r | 2102567 | 6.4% |
| t | 1244556 | 3.8% |
| n | 1059210 | 3.2% |
| p | 999396 | 3.0% |
| Other values (45) | 8403511 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29389601 | |
| Uppercase Letter | 3403303 | 10.4% |
| Other Punctuation | 2403 | < 0.1% |
| Space Separator | 766 | < 0.1% |
| Open Punctuation | 61 | < 0.1% |
| Close Punctuation | 61 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4889746 | |
| e | 3854264 | |
| s | 3030750 | |
| l | 2735926 | |
| o | 2274535 | |
| i | 2201734 | |
| r | 2102567 | |
| t | 1244556 | 4.2% |
| n | 1059210 | 3.6% |
| p | 999396 | 3.4% |
| Other values (16) | 4996917 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 745213 | |
| C | 463002 | |
| A | 376082 | |
| S | 270198 | 7.9% |
| L | 240506 | 7.1% |
| M | 214351 | 6.3% |
| R | 205758 | 6.0% |
| D | 147162 | 4.3% |
| F | 130942 | 3.8% |
| H | 121624 | 3.6% |
| Other values (14) | 488465 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2402 | |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 766 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 61 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 61 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32792904 | |
| Common | 3291 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4889746 | |
| e | 3854264 | |
| s | 3030750 | 9.2% |
| l | 2735926 | 8.3% |
| o | 2274535 | 6.9% |
| i | 2201734 | 6.7% |
| r | 2102567 | 6.4% |
| t | 1244556 | 3.8% |
| n | 1059210 | 3.2% |
| p | 999396 | 3.0% |
| Other values (40) | 8400220 |
Common
| Value | Count | Frequency (%) |
| . | 2402 | |
| 766 | 23.3% | |
| [ | 61 | 1.9% |
| ] | 61 | 1.9% |
| ? | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32796195 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4889746 | |
| e | 3854264 | |
| s | 3030750 | 9.2% |
| l | 2735926 | 8.3% |
| o | 2274535 | 6.9% |
| i | 2201734 | 6.7% |
| r | 2102567 | 6.4% |
| t | 1244556 | 3.8% |
| n | 1059210 | 3.2% |
| p | 999396 | 3.0% |
| Other values (45) | 8403511 |
family
Text
Missing 
| Distinct | 6247 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 101008 |
| Missing (%) | 2.6% |
| Memory size | 29.1 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 33 |
| Mean length | 10.82154545 |
| Min length | 6 |
Unique
| Unique | 679 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Hippolytidae |
|---|---|
| 2nd row | Brassicaceae |
| 3rd row | Plethodontidae |
| 4th row | Lumbrineridae |
| 5th row | Gomphidae |
| Value | Count | Frequency (%) |
| poaceae | 206776 | 5.6% |
| asteraceae | 147421 | 4.0% |
| fabaceae | 97640 | 2.6% |
| plethodontidae | 91218 | 2.5% |
| cyperaceae | 57015 | 1.5% |
| rubiaceae | 49120 | 1.3% |
| cricetidae | 44315 | 1.2% |
| muridae | 38592 | 1.0% |
| apidae | 34263 | 0.9% |
| melastomataceae | 30107 | 0.8% |
| Other values (6234) | 2925182 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7167192 | |
| e | 7127326 | |
| i | 3783943 | |
| c | 2767017 | 6.9% |
| d | 2343643 | 5.8% |
| o | 1987878 | 4.9% |
| r | 1899670 | 4.7% |
| l | 1560793 | 3.9% |
| n | 1413847 | 3.5% |
| t | 1394290 | 3.5% |
| Other values (54) | 8735784 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36454714 | |
| Uppercase Letter | 3713091 | 9.2% |
| Space Separator | 8558 | < 0.1% |
| Other Punctuation | 4923 | < 0.1% |
| Open Punctuation | 41 | < 0.1% |
| Close Punctuation | 41 | < 0.1% |
| Decimal Number | 10 | < 0.1% |
| Connector Punctuation | 4 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7167192 | |
| e | 7127326 | |
| i | 3783943 | |
| c | 2767017 | 7.6% |
| d | 2343643 | 6.4% |
| o | 1987878 | 5.5% |
| r | 1899670 | 5.2% |
| l | 1560793 | 4.3% |
| n | 1413847 | 3.9% |
| t | 1394290 | 3.8% |
| Other values (16) | 5009115 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 736705 | |
| C | 513178 | |
| A | 429262 | |
| S | 276035 | 7.4% |
| M | 228091 | 6.1% |
| L | 172968 | 4.7% |
| T | 149659 | 4.0% |
| R | 149142 | 4.0% |
| F | 139759 | 3.8% |
| E | 139164 | 3.7% |
| Other values (16) | 779128 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 3 | |
| 0 | 2 | |
| 1 | 2 | |
| 3 | 2 | |
| 9 | 1 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4914 | |
| ? | 9 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 8558 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 41 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 41 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40167805 | |
| Common | 13578 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7167192 | |
| e | 7127326 | |
| i | 3783943 | |
| c | 2767017 | 6.9% |
| d | 2343643 | 5.8% |
| o | 1987878 | 4.9% |
| r | 1899670 | 4.7% |
| l | 1560793 | 3.9% |
| n | 1413847 | 3.5% |
| t | 1394290 | 3.5% |
| Other values (42) | 8722206 |
Common
| Value | Count | Frequency (%) |
| 8558 | ||
| . | 4914 | |
| ( | 41 | 0.3% |
| ) | 41 | 0.3% |
| ? | 9 | 0.1% |
| _ | 4 | < 0.1% |
| 6 | 3 | < 0.1% |
| 0 | 2 | < 0.1% |
| 1 | 2 | < 0.1% |
| 3 | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40181383 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7167192 | |
| e | 7127326 | |
| i | 3783943 | |
| c | 2767017 | 6.9% |
| d | 2343643 | 5.8% |
| o | 1987878 | 4.9% |
| r | 1899670 | 4.7% |
| l | 1560793 | 3.9% |
| n | 1413847 | 3.5% |
| t | 1394290 | 3.5% |
| Other values (54) | 8735784 |
subfamily
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | (Aubl.) R.A. Howard |
|---|
| Value | Count | Frequency (%) |
| aubl | 1 | |
| r.a | 1 | |
| howard | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 3 | |
| A | 2 | 10.5% |
| 2 | 10.5% | |
| ( | 1 | 5.3% |
| u | 1 | 5.3% |
| b | 1 | 5.3% |
| l | 1 | 5.3% |
| ) | 1 | 5.3% |
| R | 1 | 5.3% |
| H | 1 | 5.3% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8 | |
| Uppercase Letter | 4 | |
| Other Punctuation | 3 | 15.8% |
| Space Separator | 2 | 10.5% |
| Open Punctuation | 1 | 5.3% |
| Close Punctuation | 1 | 5.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 1 | |
| b | 1 | |
| l | 1 | |
| o | 1 | |
| w | 1 | |
| a | 1 | |
| r | 1 | |
| d | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| R | 1 | |
| H | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 | |
| Common | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 2 | |
| u | 1 | |
| b | 1 | |
| l | 1 | |
| R | 1 | |
| H | 1 | |
| o | 1 | |
| w | 1 | |
| a | 1 | |
| r | 1 |
Common
| Value | Count | Frequency (%) |
| . | 3 | |
| 2 | ||
| ( | 1 | 14.3% |
| ) | 1 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 3 | |
| A | 2 | 10.5% |
| 2 | 10.5% | |
| ( | 1 | 5.3% |
| u | 1 | 5.3% |
| b | 1 | 5.3% |
| l | 1 | 5.3% |
| ) | 1 | 5.3% |
| R | 1 | 5.3% |
| H | 1 | 5.3% |
| Other values (5) | 5 |
genus
Text
Missing 
| Distinct | 70442 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 162837 |
| Missing (%) | 4.3% |
| Memory size | 29.1 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 25 |
| Mean length | 8.949369834 |
| Min length | 1 |
Unique
| Unique | 20728 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Lesquerella |
|---|---|
| 2nd row | Desmognathus |
| 3rd row | Ninoe |
| 4th row | Gomphus |
| 5th row | Skrjabinoclava |
| Value | Count | Frequency (%) |
| plethodon | 69419 | 1.9% |
| bombus | 25851 | 0.7% |
| carex | 23618 | 0.6% |
| peromyscus | 16159 | 0.4% |
| desmognathus | 14837 | 0.4% |
| indet | 14188 | 0.4% |
| poa | 12303 | 0.3% |
| cyperus | 11356 | 0.3% |
| cladonia | 11033 | 0.3% |
| paspalum | 10616 | 0.3% |
| Other values (70416) | 3448082 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3570749 | 10.9% |
| i | 2703577 | 8.3% |
| o | 2622759 | 8.0% |
| e | 2270868 | 6.9% |
| s | 2132346 | 6.5% |
| r | 2080887 | 6.4% |
| l | 1789654 | 5.5% |
| u | 1660949 | 5.1% |
| n | 1617510 | 5.0% |
| t | 1546059 | 4.7% |
| Other values (62) | 10681136 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29005224 | |
| Uppercase Letter | 3650636 | 11.2% |
| Other Punctuation | 14221 | < 0.1% |
| Space Separator | 6200 | < 0.1% |
| Open Punctuation | 70 | < 0.1% |
| Close Punctuation | 70 | < 0.1% |
| Dash Punctuation | 49 | < 0.1% |
| Decimal Number | 15 | < 0.1% |
| Connector Punctuation | 8 | < 0.1% |
| Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3570749 | |
| i | 2703577 | 9.3% |
| o | 2622759 | 9.0% |
| e | 2270868 | 7.8% |
| s | 2132346 | 7.4% |
| r | 2080887 | 7.2% |
| l | 1789654 | 6.2% |
| u | 1660949 | 5.7% |
| n | 1617510 | 5.6% |
| t | 1546059 | 5.3% |
| Other values (18) | 7009866 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 539981 | |
| C | 485059 | |
| A | 336052 | 9.2% |
| S | 325425 | 8.9% |
| M | 245659 | 6.7% |
| L | 195244 | 5.3% |
| E | 189255 | 5.2% |
| T | 174807 | 4.8% |
| D | 165471 | 4.5% |
| B | 152665 | 4.2% |
| Other values (16) | 841018 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 3 | 4 | |
| 6 | 3 | |
| 1 | 2 | |
| 4 | 1 | 6.7% |
| 9 | 1 | 6.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 14196 | |
| ? | 20 | 0.1% |
| / | 4 | < 0.1% |
| ! | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 46 | |
| [ | 24 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 46 | |
| ] | 24 |
Space Separator
| Value | Count | Frequency (%) |
| 6200 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 49 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32655860 | |
| Common | 20634 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3570749 | 10.9% |
| i | 2703577 | 8.3% |
| o | 2622759 | 8.0% |
| e | 2270868 | 7.0% |
| s | 2132346 | 6.5% |
| r | 2080887 | 6.4% |
| l | 1789654 | 5.5% |
| u | 1660949 | 5.1% |
| n | 1617510 | 5.0% |
| t | 1546059 | 4.7% |
| Other values (44) | 10660502 |
Common
| Value | Count | Frequency (%) |
| . | 14196 | |
| 6200 | ||
| - | 49 | 0.2% |
| ( | 46 | 0.2% |
| ) | 46 | 0.2% |
| [ | 24 | 0.1% |
| ] | 24 | 0.1% |
| ? | 20 | 0.1% |
| _ | 8 | < 0.1% |
| 0 | 4 | < 0.1% |
| Other values (8) | 17 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32676225 | |
| None | 268 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3570749 | 10.9% |
| i | 2703577 | 8.3% |
| o | 2622759 | 8.0% |
| e | 2270868 | 6.9% |
| s | 2132346 | 6.5% |
| r | 2080887 | 6.4% |
| l | 1789654 | 5.5% |
| u | 1660949 | 5.1% |
| n | 1617510 | 5.0% |
| t | 1546059 | 4.7% |
| Other values (59) | 10680867 |
None
| Value | Count | Frequency (%) |
| ë | 264 | |
| ö | 4 | 1.5% |
Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
subgenus
Text
Missing 
| Distinct | 4536 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 3729484 |
| Missing (%) | 97.8% |
| Memory size | 29.1 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 17 |
| Mean length | 10.12022691 |
| Min length | 1 |
Unique
| Unique | 1590 ? |
|---|---|
| Unique (%) | 1.9% |
Sample
| 1st row | Colobostylus |
|---|---|
| 2nd row | Tricyphona |
| 3rd row | Angulus |
| 4th row | Costellaria |
| 5th row | Agathistoma |
| Value | Count | Frequency (%) |
| pyrobombus | 8813 | 10.4% |
| bombus | 2923 | 3.5% |
| apis | 1481 | 1.8% |
| thericium | 1417 | 1.7% |
| fervidobombus | 1384 | 1.6% |
| depressicambarus | 1232 | 1.5% |
| ortmannicus | 1037 | 1.2% |
| stephanoconus | 1008 | 1.2% |
| neoxylocopa | 981 | 1.2% |
| alpinobombus | 647 | 0.8% |
| Other values (4526) | 63703 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 89127 | 10.4% |
| a | 81161 | 9.5% |
| i | 64482 | 7.5% |
| s | 63807 | 7.5% |
| r | 60943 | 7.1% |
| u | 51272 | 6.0% |
| e | 44787 | 5.2% |
| m | 41692 | 4.9% |
| l | 41013 | 4.8% |
| b | 38453 | 4.5% |
| Other values (45) | 279586 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 771663 | |
| Uppercase Letter | 84615 | 9.9% |
| Other Punctuation | 34 | < 0.1% |
| Space Separator | 11 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 89127 | |
| a | 81161 | |
| i | 64482 | 8.4% |
| s | 63807 | 8.3% |
| r | 60943 | 7.9% |
| u | 51272 | 6.6% |
| e | 44787 | 5.8% |
| m | 41692 | 5.4% |
| l | 41013 | 5.3% |
| b | 38453 | 5.0% |
| Other values (16) | 194926 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 18761 | |
| C | 9181 | |
| A | 8176 | |
| S | 6424 | 7.6% |
| T | 5387 | 6.4% |
| M | 5100 | 6.0% |
| B | 4333 | 5.1% |
| L | 3486 | 4.1% |
| D | 3473 | 4.1% |
| N | 3241 | 3.8% |
| Other values (16) | 17053 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 32 | |
| ? | 2 | 5.9% |
Space Separator
| Value | Count | Frequency (%) |
| 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 856278 | |
| Common | 45 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 89127 | 10.4% |
| a | 81161 | 9.5% |
| i | 64482 | 7.5% |
| s | 63807 | 7.5% |
| r | 60943 | 7.1% |
| u | 51272 | 6.0% |
| e | 44787 | 5.2% |
| m | 41692 | 4.9% |
| l | 41013 | 4.8% |
| b | 38453 | 4.5% |
| Other values (42) | 279541 |
Common
| Value | Count | Frequency (%) |
| . | 32 | |
| 11 | 24.4% | |
| ? | 2 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 856323 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 89127 | 10.4% |
| a | 81161 | 9.5% |
| i | 64482 | 7.5% |
| s | 63807 | 7.5% |
| r | 60943 | 7.1% |
| u | 51272 | 6.0% |
| e | 44787 | 5.2% |
| m | 41692 | 4.9% |
| l | 41013 | 4.8% |
| b | 38453 | 4.5% |
| Other values (45) | 279586 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 16 |
| Min length | 14 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Carex maculata |
|---|---|
| 2nd row | Tursiops truncatus |
| Value | Count | Frequency (%) |
| carex | 1 | |
| maculata | 1 | |
| tursiops | 1 | |
| truncatus | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5 | |
| u | 4 | |
| r | 3 | |
| s | 3 | |
| t | 3 | |
| 2 | 6.2% | |
| c | 2 | 6.2% |
| T | 1 | 3.1% |
| p | 1 | 3.1% |
| o | 1 | 3.1% |
| Other values (7) | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28 | |
| Space Separator | 2 | 6.2% |
| Uppercase Letter | 2 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5 | |
| u | 4 | |
| r | 3 | |
| s | 3 | |
| t | 3 | |
| c | 2 | 7.1% |
| p | 1 | 3.6% |
| o | 1 | 3.6% |
| i | 1 | 3.6% |
| l | 1 | 3.6% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30 | |
| Common | 2 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5 | |
| u | 4 | |
| r | 3 | |
| s | 3 | |
| t | 3 | |
| c | 2 | 6.7% |
| T | 1 | 3.3% |
| p | 1 | 3.3% |
| o | 1 | 3.3% |
| i | 1 | 3.3% |
| Other values (6) | 6 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5 | |
| u | 4 | |
| r | 3 | |
| s | 3 | |
| t | 3 | |
| 2 | 6.2% | |
| c | 2 | 6.2% |
| T | 1 | 3.1% |
| p | 1 | 3.1% |
| o | 1 | 3.1% |
| Other values (7) | 7 |
specificEpithet
Text
Missing 
| Distinct | 136016 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 190700 |
| Missing (%) | 5.0% |
| Memory size | 29.1 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 8.563882697 |
| Min length | 1 |
Unique
| Unique | 56290 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | lescurii |
|---|---|
| 2nd row | ochrophaeus |
| 3rd row | kinbergi |
| 4th row | adelphus |
| 5th row | catoptrophori |
| Value | Count | Frequency (%) |
| sp | 223489 | 6.2% |
| cinereus | 33845 | 0.9% |
| americana | 8943 | 0.2% |
| gracilis | 8698 | 0.2% |
| canadensis | 7759 | 0.2% |
| maniculatus | 6546 | 0.2% |
| occidentalis | 6486 | 0.2% |
| fuscus | 6485 | 0.2% |
| elegans | 6236 | 0.2% |
| montanus | 6177 | 0.2% |
| Other values (135825) | 3311792 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3916798 | |
| i | 3443476 | |
| s | 2774878 | 8.9% |
| e | 2213681 | 7.1% |
| r | 2045039 | 6.6% |
| u | 1975526 | 6.4% |
| n | 1922945 | 6.2% |
| l | 1895407 | 6.1% |
| t | 1674637 | 5.4% |
| o | 1656555 | 5.3% |
| Other values (48) | 7511422 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 30788068 | |
| Other Punctuation | 229903 | 0.7% |
| Dash Punctuation | 8742 | < 0.1% |
| Space Separator | 3057 | < 0.1% |
| Decimal Number | 443 | < 0.1% |
| Connector Punctuation | 112 | < 0.1% |
| Open Punctuation | 18 | < 0.1% |
| Close Punctuation | 18 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3916798 | |
| i | 3443476 | |
| s | 2774878 | 9.0% |
| e | 2213681 | 7.2% |
| r | 2045039 | 6.6% |
| u | 1975526 | 6.4% |
| n | 1922945 | 6.2% |
| l | 1895407 | 6.2% |
| t | 1674637 | 5.4% |
| o | 1656555 | 5.4% |
| Other values (20) | 7269126 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 156 | |
| 2 | 61 | 13.8% |
| 0 | 48 | 10.8% |
| 3 | 42 | 9.5% |
| 9 | 37 | 8.4% |
| 4 | 25 | 5.6% |
| 6 | 23 | 5.2% |
| 7 | 20 | 4.5% |
| 8 | 16 | 3.6% |
| 5 | 15 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 226830 | |
| " | 2914 | 1.3% |
| ' | 58 | < 0.1% |
| / | 51 | < 0.1% |
| ? | 27 | < 0.1% |
| # | 18 | < 0.1% |
| , | 3 | < 0.1% |
| ; | 1 | < 0.1% |
| † | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 16 | |
| [ | 2 | 11.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 16 | |
| ] | 2 | 11.1% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 2 | |
| = | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8742 |
Space Separator
| Value | Count | Frequency (%) |
| 3057 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 112 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30788068 | |
| Common | 242296 | 0.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3916798 | |
| i | 3443476 | |
| s | 2774878 | 9.0% |
| e | 2213681 | 7.2% |
| r | 2045039 | 6.6% |
| u | 1975526 | 6.4% |
| n | 1922945 | 6.2% |
| l | 1895407 | 6.2% |
| t | 1674637 | 5.4% |
| o | 1656555 | 5.4% |
| Other values (20) | 7269126 |
Common
| Value | Count | Frequency (%) |
| . | 226830 | |
| - | 8742 | 3.6% |
| 3057 | 1.3% | |
| " | 2914 | 1.2% |
| 1 | 156 | 0.1% |
| _ | 112 | < 0.1% |
| 2 | 61 | < 0.1% |
| ' | 58 | < 0.1% |
| / | 51 | < 0.1% |
| 0 | 48 | < 0.1% |
| Other values (18) | 267 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31030323 | |
| None | 40 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3916798 | |
| i | 3443476 | |
| s | 2774878 | 8.9% |
| e | 2213681 | 7.1% |
| r | 2045039 | 6.6% |
| u | 1975526 | 6.4% |
| n | 1922945 | 6.2% |
| l | 1895407 | 6.1% |
| t | 1674637 | 5.4% |
| o | 1656555 | 5.3% |
| Other values (43) | 7511381 |
None
| Value | Count | Frequency (%) |
| ë | 27 | |
| ü | 10 | 25.0% |
| ñ | 2 | 5.0% |
| æ | 1 | 2.5% |
Punctuation
| Value | Count | Frequency (%) |
| † | 1 |
Missing 
| Distinct | 24030 |
|---|---|
| Distinct (%) | 5.6% |
| Missing | 3381784 |
| Missing (%) | 88.7% |
| Memory size | 29.1 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 29 |
| Mean length | 8.964754866 |
| Min length | 1 |
Unique
| Unique | 8599 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | cinnamomina |
|---|---|
| 2nd row | berlandieri |
| 3rd row | mellodora |
| 4th row | rubiginosa |
| 5th row | spergulariiforme |
| Value | Count | Frequency (%) |
| noveboracensis | 2209 | 0.5% |
| domesticus | 2097 | 0.5% |
| acuminatum | 1842 | 0.4% |
| pennsylvanicus | 1771 | 0.4% |
| cinereus | 1593 | 0.4% |
| carolinensis | 1550 | 0.4% |
| talpoides | 1538 | 0.4% |
| minor | 1414 | 0.3% |
| occidentalis | 1410 | 0.3% |
| gambelii | 1301 | 0.3% |
| Other values (23958) | 416193 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 453190 | |
| a | 451397 | |
| s | 381313 | |
| e | 296985 | 7.7% |
| n | 268978 | 6.9% |
| r | 258031 | 6.7% |
| u | 250988 | 6.5% |
| l | 227869 | 5.9% |
| o | 216334 | 5.6% |
| c | 196282 | 5.1% |
| Other values (37) | 874231 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3874278 | |
| Space Separator | 603 | < 0.1% |
| Dash Punctuation | 341 | < 0.1% |
| Other Punctuation | 275 | < 0.1% |
| Uppercase Letter | 38 | < 0.1% |
| Open Punctuation | 30 | < 0.1% |
| Close Punctuation | 30 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 453190 | |
| a | 451397 | |
| s | 381313 | |
| e | 296985 | 7.7% |
| n | 268978 | 6.9% |
| r | 258031 | 6.7% |
| u | 250988 | 6.5% |
| l | 227869 | 5.9% |
| o | 216334 | 5.6% |
| c | 196282 | 5.1% |
| Other values (18) | 872911 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 11 | |
| F | 11 | |
| C | 6 | |
| B | 2 | 5.3% |
| A | 2 | 5.3% |
| O | 2 | 5.3% |
| H | 2 | 5.3% |
| V | 1 | 2.6% |
| D | 1 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 232 | |
| ? | 17 | 6.2% |
| ' | 15 | 5.5% |
| " | 6 | 2.2% |
| / | 5 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 603 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 341 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 30 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 30 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3874316 | |
| Common | 1282 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 453190 | |
| a | 451397 | |
| s | 381313 | |
| e | 296985 | 7.7% |
| n | 268978 | 6.9% |
| r | 258031 | 6.7% |
| u | 250988 | 6.5% |
| l | 227869 | 5.9% |
| o | 216334 | 5.6% |
| c | 196282 | 5.1% |
| Other values (27) | 872949 |
Common
| Value | Count | Frequency (%) |
| 603 | ||
| - | 341 | |
| . | 232 | 18.1% |
| ( | 30 | 2.3% |
| ) | 30 | 2.3% |
| ? | 17 | 1.3% |
| ' | 15 | 1.2% |
| " | 6 | 0.5% |
| / | 5 | 0.4% |
| × | 3 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3875593 | |
| None | 5 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 453190 | |
| a | 451397 | |
| s | 381313 | |
| e | 296985 | 7.7% |
| n | 268978 | 6.9% |
| r | 258031 | 6.7% |
| u | 250988 | 6.5% |
| l | 227869 | 5.9% |
| o | 216334 | 5.6% |
| c | 196282 | 5.1% |
| Other values (34) | 874226 |
None
| Value | Count | Frequency (%) |
| × | 3 | |
| ß | 1 | 20.0% |
| ë | 1 | 20.0% |
taxonRank
Text
Missing 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3381907 |
| Missing (%) | 88.7% |
| Memory size | 29.1 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 10 |
| Mean length | 9.351105527 |
| Min length | 2 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | subspecies |
|---|---|
| 2nd row | subspecies |
| 3rd row | variety |
| 4th row | subspecies |
| 5th row | subspecies |
| Value | Count | Frequency (%) |
| subspecies | 341855 | |
| variety | 85773 | 19.8% |
| forma | 3328 | 0.8% |
| var | 898 | 0.2% |
| form | 78 | < 0.1% |
| aberration | 71 | < 0.1% |
| race | 33 | < 0.1% |
| subvariety | 32 | < 0.1% |
| aff | 31 | < 0.1% |
| nothosubsp | 26 | < 0.1% |
| Other values (19) | 72 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 1025667 | |
| e | 769648 | |
| i | 427740 | |
| b | 341987 | 8.5% |
| u | 341923 | 8.5% |
| p | 341908 | 8.5% |
| c | 341902 | 8.5% |
| r | 90283 | 2.2% |
| a | 90204 | 2.2% |
| t | 85928 | 2.1% |
| Other values (22) | 184283 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4028226 | |
| Uppercase Letter | 12269 | 0.3% |
| Other Punctuation | 965 | < 0.1% |
| Space Separator | 5 | < 0.1% |
| Open Punctuation | 4 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1025667 | |
| e | 769648 | |
| i | 427740 | |
| b | 341987 | 8.5% |
| u | 341923 | 8.5% |
| p | 341908 | 8.5% |
| c | 341902 | 8.5% |
| r | 90283 | 2.2% |
| a | 90204 | 2.2% |
| t | 85928 | 2.1% |
| Other values (11) | 171036 | 4.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 12023 | |
| F | 132 | 1.1% |
| A | 68 | 0.6% |
| R | 29 | 0.2% |
| M | 9 | 0.1% |
| U | 4 | < 0.1% |
| C | 4 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 965 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 4 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4040495 | |
| Common | 978 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 1025667 | |
| e | 769648 | |
| i | 427740 | |
| b | 341987 | 8.5% |
| u | 341923 | 8.5% |
| p | 341908 | 8.5% |
| c | 341902 | 8.5% |
| r | 90283 | 2.2% |
| a | 90204 | 2.2% |
| t | 85928 | 2.1% |
| Other values (18) | 183305 | 4.5% |
Common
| Value | Count | Frequency (%) |
| . | 965 | |
| 5 | 0.5% | |
| [ | 4 | 0.4% |
| ] | 4 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4041473 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 1025667 | |
| e | 769648 | |
| i | 427740 | |
| b | 341987 | 8.5% |
| u | 341923 | 8.5% |
| p | 341908 | 8.5% |
| c | 341902 | 8.5% |
| r | 90283 | 2.2% |
| a | 90204 | 2.2% |
| t | 85928 | 2.1% |
| Other values (22) | 184283 | 4.6% |
Missing 
| Distinct | 66565 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 1431500 |
| Missing (%) | 37.5% |
| Memory size | 29.1 MiB |
Length
| Max length | 255 |
|---|---|
| Median length | 65 |
| Mean length | 10.72469308 |
| Min length | 2 |
Unique
| Unique | 18307 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | (A. Gray) S. Watson |
|---|---|
| 2nd row | Ehlers |
| 3rd row | Selys |
| 4th row | Badley |
| 5th row | Paulson |
| Value | Count | Frequency (%) |
| 275794 | 6.1% | |
| l | 269565 | 6.0% |
| ex | 120679 | 2.7% |
| a | 76166 | 1.7% |
| dc | 56611 | 1.3% |
| gray | 48180 | 1.1% |
| kunth | 44183 | 1.0% |
| linnaeus | 41860 | 0.9% |
| benth | 41199 | 0.9% |
| sw | 36382 | 0.8% |
| Other values (17753) | 3505839 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 2374881 | 9.3% |
| 2133859 | 8.4% | |
| e | 1748727 | 6.8% |
| r | 1301606 | 5.1% |
| a | 1260238 | 4.9% |
| n | 1150287 | 4.5% |
| l | 1072305 | 4.2% |
| ( | 1016637 | 4.0% |
| ) | 1016637 | 4.0% |
| i | 905553 | 3.5% |
| Other values (102) | 11571913 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14358899 | |
| Uppercase Letter | 4346097 | 17.0% |
| Other Punctuation | 2659953 | 10.4% |
| Space Separator | 2133859 | 8.4% |
| Open Punctuation | 1016637 | 4.0% |
| Close Punctuation | 1016637 | 4.0% |
| Dash Punctuation | 20561 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1748727 | |
| r | 1301606 | 9.1% |
| a | 1260238 | 8.8% |
| n | 1150287 | 8.0% |
| l | 1072305 | 7.5% |
| i | 905553 | 6.3% |
| o | 904808 | 6.3% |
| t | 796053 | 5.5% |
| s | 739601 | 5.2% |
| u | 621804 | 4.3% |
| Other values (54) | 3857917 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 513452 | |
| S | 434070 | 10.0% |
| B | 332546 | 7.7% |
| H | 313612 | 7.2% |
| M | 311417 | 7.2% |
| C | 300680 | 6.9% |
| R | 234805 | 5.4% |
| A | 223211 | 5.1% |
| G | 216969 | 5.0% |
| D | 209797 | 4.8% |
| Other values (27) | 1255538 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2374881 | |
| & | 276039 | 10.4% |
| ' | 6061 | 0.2% |
| , | 1799 | 0.1% |
| \ | 1165 | < 0.1% |
| ? | 5 | < 0.1% |
| ; | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2133859 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1016637 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1016637 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 20561 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18704996 | |
| Common | 6847647 | 26.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1748727 | 9.3% |
| r | 1301606 | 7.0% |
| a | 1260238 | 6.7% |
| n | 1150287 | 6.1% |
| l | 1072305 | 5.7% |
| i | 905553 | 4.8% |
| o | 904808 | 4.8% |
| t | 796053 | 4.3% |
| s | 739601 | 4.0% |
| u | 621804 | 3.3% |
| Other values (91) | 8204014 |
Common
| Value | Count | Frequency (%) |
| . | 2374881 | |
| 2133859 | ||
| ( | 1016637 | |
| ) | 1016637 | |
| & | 276039 | 4.0% |
| - | 20561 | 0.3% |
| ' | 6061 | 0.1% |
| , | 1799 | < 0.1% |
| \ | 1165 | < 0.1% |
| ? | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25457851 | |
| None | 94792 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 2374881 | 9.3% |
| 2133859 | 8.4% | |
| e | 1748727 | 6.9% |
| r | 1301606 | 5.1% |
| a | 1260238 | 5.0% |
| n | 1150287 | 4.5% |
| l | 1072305 | 4.2% |
| ( | 1016637 | 4.0% |
| ) | 1016637 | 4.0% |
| i | 905553 | 3.6% |
| Other values (53) | 11477121 |
None
| Value | Count | Frequency (%) |
| ü | 33352 | |
| é | 18990 | |
| ö | 11902 | 12.6% |
| è | 8126 | 8.6% |
| ä | 4210 | 4.4% |
| á | 3601 | 3.8% |
| Á | 3155 | 3.3% |
| ø | 2575 | 2.7% |
| ó | 1563 | 1.6% |
| Ø | 1268 | 1.3% |
| Other values (39) | 6050 | 6.4% |
vernacularName
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814096 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 84 |
|---|---|
| Median length | 57 |
| Mean length | 49.66666667 |
| Min length | 8 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Plantae, Monocotyledonae, Poales, Cyperaceae, Cyperoideae |
|---|---|
| 2nd row | Holotype |
| 3rd row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Cetacea, Odontoceti, Delphinidae |
| Value | Count | Frequency (%) |
| plantae | 1 | 7.1% |
| monocotyledonae | 1 | 7.1% |
| poales | 1 | 7.1% |
| cyperaceae | 1 | 7.1% |
| cyperoideae | 1 | 7.1% |
| holotype | 1 | 7.1% |
| animalia | 1 | 7.1% |
| chordata | 1 | 7.1% |
| vertebrata | 1 | 7.1% |
| mammalia | 1 | 7.1% |
| Other values (4) | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 20 | |
| e | 19 | |
| , | 11 | 7.4% |
| 11 | 7.4% | |
| o | 11 | 7.4% |
| t | 10 | 6.7% |
| i | 8 | 5.4% |
| l | 7 | 4.7% |
| r | 6 | 4.0% |
| n | 6 | 4.0% |
| Other values (18) | 40 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 113 | |
| Uppercase Letter | 14 | 9.4% |
| Other Punctuation | 11 | 7.4% |
| Space Separator | 11 | 7.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 20 | |
| e | 19 | |
| o | 11 | |
| t | 10 | |
| i | 8 | 7.1% |
| l | 7 | 6.2% |
| r | 6 | 5.3% |
| n | 6 | 5.3% |
| d | 5 | 4.4% |
| p | 4 | 3.5% |
| Other values (7) | 17 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 4 | |
| P | 2 | |
| M | 2 | |
| H | 1 | 7.1% |
| A | 1 | 7.1% |
| V | 1 | 7.1% |
| E | 1 | 7.1% |
| O | 1 | 7.1% |
| D | 1 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 11 |
Space Separator
| Value | Count | Frequency (%) |
| 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 127 | |
| Common | 22 | 14.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 20 | |
| e | 19 | |
| o | 11 | 8.7% |
| t | 10 | 7.9% |
| i | 8 | 6.3% |
| l | 7 | 5.5% |
| r | 6 | 4.7% |
| n | 6 | 4.7% |
| d | 5 | 3.9% |
| p | 4 | 3.1% |
| Other values (16) | 31 |
Common
| Value | Count | Frequency (%) |
| , | 11 | |
| 11 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 149 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 20 | |
| e | 19 | |
| , | 11 | 7.4% |
| 11 | 7.4% | |
| o | 11 | 7.4% |
| t | 10 | 6.7% |
| i | 8 | 5.4% |
| l | 7 | 4.7% |
| r | 6 | 4.0% |
| n | 6 | 4.0% |
| Other values (18) | 40 |
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814094 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 11.4 |
| Min length | 7 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Cram, E. B. |
|---|---|
| 2nd row | Howell, Tiffany |
| 3rd row | Plantae |
| 4th row | Maccallum, G. A. |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| cram | 1 | |
| e | 1 | |
| b | 1 | |
| howell | 1 | |
| tiffany | 1 | |
| plantae | 1 | |
| maccallum | 1 | |
| g | 1 | |
| a | 1 | |
| animalia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| l | 6 | 10.5% |
| 5 | 8.8% | |
| . | 4 | 7.0% |
| m | 3 | 5.3% |
| , | 3 | 5.3% |
| n | 3 | 5.3% |
| i | 3 | 5.3% |
| e | 2 | 3.5% |
| c | 2 | 3.5% |
| Other values (16) | 18 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35 | |
| Uppercase Letter | 10 | 17.5% |
| Other Punctuation | 7 | 12.3% |
| Space Separator | 5 | 8.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| l | 6 | |
| m | 3 | 8.6% |
| n | 3 | 8.6% |
| i | 3 | 8.6% |
| e | 2 | 5.7% |
| c | 2 | 5.7% |
| f | 2 | 5.7% |
| w | 1 | 2.9% |
| r | 1 | 2.9% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| T | 1 | |
| H | 1 | |
| B | 1 | |
| P | 1 | |
| M | 1 | |
| E | 1 | |
| G | 1 | |
| C | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 | |
| , | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45 | |
| Common | 12 | 21.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| l | 6 | |
| m | 3 | 6.7% |
| n | 3 | 6.7% |
| i | 3 | 6.7% |
| e | 2 | 4.4% |
| c | 2 | 4.4% |
| f | 2 | 4.4% |
| A | 2 | 4.4% |
| w | 1 | 2.2% |
| Other values (13) | 13 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| . | 4 | |
| , | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| l | 6 | 10.5% |
| 5 | 8.8% | |
| . | 4 | 7.0% |
| m | 3 | 5.3% |
| , | 3 | 5.3% |
| n | 3 | 5.3% |
| i | 3 | 5.3% |
| e | 2 | 3.5% |
| c | 2 | 3.5% |
| Other values (16) | 18 |
taxonomicStatus
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814098 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Chordata |
|---|
| Value | Count | Frequency (%) |
| chordata | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| C | 1 | |
| h | 1 | |
| o | 1 | |
| r | 1 | |
| d | 1 | |
| t | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 1 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| h | 1 | |
| o | 1 | |
| r | 1 | |
| d | 1 | |
| t | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| C | 1 | |
| h | 1 | |
| o | 1 | |
| r | 1 | |
| d | 1 | |
| t | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| C | 1 | |
| h | 1 | |
| o | 1 | |
| r | 1 | |
| d | 1 | |
| t | 1 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 11.5 |
| Mean length | 11.5 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Monocotyledonae |
|---|---|
| 2nd row | Mammalia |
| Value | Count | Frequency (%) |
| monocotyledonae | 1 | |
| mammalia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 4 | |
| a | 4 | |
| M | 2 | |
| n | 2 | |
| l | 2 | |
| e | 2 | |
| m | 2 | |
| c | 1 | 4.3% |
| t | 1 | 4.3% |
| y | 1 | 4.3% |
| Other values (2) | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21 | |
| Uppercase Letter | 2 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 4 | |
| a | 4 | |
| n | 2 | |
| l | 2 | |
| e | 2 | |
| m | 2 | |
| c | 1 | 4.8% |
| t | 1 | 4.8% |
| y | 1 | 4.8% |
| d | 1 | 4.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 4 | |
| a | 4 | |
| M | 2 | |
| n | 2 | |
| l | 2 | |
| e | 2 | |
| m | 2 | |
| c | 1 | 4.3% |
| t | 1 | 4.3% |
| y | 1 | 4.3% |
| Other values (2) | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 4 | |
| a | 4 | |
| M | 2 | |
| n | 2 | |
| l | 2 | |
| e | 2 | |
| m | 2 | |
| c | 1 | 4.3% |
| t | 1 | 4.3% |
| y | 1 | 4.3% |
| Other values (2) | 2 |
taxonRemarks
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3814097 |
| Missing (%) | > 99.9% |
| Memory size | 29.1 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6.5 |
| Mean length | 6.5 |
| Min length | 6 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Poales |
|---|---|
| 2nd row | Cetacea |
| Value | Count | Frequency (%) |
| poales | 1 | |
| cetacea | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 3 | |
| P | 1 | 7.7% |
| o | 1 | 7.7% |
| l | 1 | 7.7% |
| s | 1 | 7.7% |
| C | 1 | 7.7% |
| t | 1 | 7.7% |
| c | 1 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11 | |
| Uppercase Letter | 2 | 15.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 3 | |
| o | 1 | 9.1% |
| l | 1 | 9.1% |
| s | 1 | 9.1% |
| t | 1 | 9.1% |
| c | 1 | 9.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 3 | |
| P | 1 | 7.7% |
| o | 1 | 7.7% |
| l | 1 | 7.7% |
| s | 1 | 7.7% |
| C | 1 | 7.7% |
| t | 1 | 7.7% |
| c | 1 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 3 | |
| P | 1 | 7.7% |
| o | 1 | 7.7% |
| l | 1 | 7.7% |
| s | 1 | 7.7% |
| C | 1 | 7.7% |
| t | 1 | 7.7% |
| c | 1 | 7.7% |